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We develop a new method of combining cluster observables (number counts and cluster-cluster 
correlation functions) and stacked weak lensing signals of background galaxy shapes, both of which 
are available in a wide-field optical imaging survey. Assuming that the clusters have secure redshift 
estimates, we show that the joint experiment enables a self-calibration of important systematic 
errors inherent in these measurements, including the source redshift uncertainty and the cluster 
mass-observable relation, by adopting a single population of background source galaxies for the 
lensing analysis. The single source galaxy population allows us to use the relative strengths of the 
stacked lensing signals at different cluster redshifts for calibrating the source redshift uncertainty, 
which in turn leads to accurate measurements of the mean cluster mass in each redshift and mass 
bin. In addition, our formulation of the stacked lensing signals in Fourier space simplifies the Fisher 
matrix calculations, as well as the marginalization over the cluster off-centering effect which is one of 
the most significant uncertainties in the stacked lensing analysis. We show that upcoming wide-field 
surveys covering more than a few thousand square degrees yield stringent constraints on cosmological 
parameters including dark energy parameters, without any priors on nuisance parameters that model 
systematic uncertainties. Specifically, the stacked lensing information improves the dark energy 
figure of merit (FoM) by a factor of 4, compared to that from the cluster observables alone. The 
primordial non-Gaussianity parameter can also be constrained with a level of (t(/nl) ~ 10. In this 
method, the mean source redshift is well calibrated to an accuracy of 0.1 in redshift, and the mean 
cluster mass in each bin to 5-10% accuracies, which demonstrates the success of the self-calibration 
of systematic uncertainties from the joint experiment. 



I. INTRODUCTION 

The abundance and its evolution of clusters of galax- 
ies are thought to be one of main probes of cosmology 
including dark energy (e.g., [Tj]). This is because dark 
matter plays an essential role in the formation processes, 
indicating that the abundance and clustering properties 
of clusters can be predicted reasonably well by theory. In 
addition, the redshift evolutions of the cluster abundance 
and their clustering amplitudes are sensitive to the linear 
growth rate, which in turn is sensitive to the expansion 
history of the universe. In fact recent careful analyses of 
X-ray @, [j| and optical [H clusters have demonstrated 
that cluster abundance can provide useful constraints on 
cosmological parameters. In addition, clusters serve as 
powerful tests of non-standard cosmological scenarios in- 
cluding the modified theory of gravity [5|-|7| and primor- 
dial non-Gaussianity [8t-ll2j. 

Perhaps the most significant challenge in cluster cos- 
mology lies in the determination of masses for individual 
clusters or for a cluster sample as a whole. In many cases, 
we resort to mass-observable scaling relations, which have 
been calibrated by the intensive observations of (a sub- 
sample of) the clusters, in order to infer clusters masses. 
Popular choices of such observables include X-ray lumi- 
nosities [l3l-[l5j|. X-ray temperatures [l6l - [l9| , gas masses 
[20l | . optical richnesses [2l|, the Sunyaev-Zel'dovich (SZ) 
effect (the integrated gas pressure along the line-of-sight) 
[22I [23] and its X-ray analog [2~4Tj2rl |. However, it is not 



an easy task to derive cluster masses observationally for 
calibrations, and moreover, we have to take account of 
its redshift evolution for robust cosmological studies. 

Weak gravitational lensing provides a very powerful 
tool for measuring cluster masses. Gravitational lens- 
ing induces coherent tangential distortions of background 
galaxy shapes around a cluster, from which we can di- 
rectly measure total mass profiles of the cluster (see [27j 
for a review). While accurate mass determinations with 
weak lensing are feasible only for massive clusters, by 
stacking signals from many clusters we can measure aver- 
age masses of clusters down to less massive halos 0, [28l - 
|33| . This stacked lensing technique has indeed been ap- 
plied to calibrate the mass-observable relation to derive 
interesting constraints on cosmological parameters (e.g., 
U). However, the stacked lensing analysis involves vari- 
ous systematic uncertainties. For instance, all weak lens- 
ing analysis is subject to the uncertainty in the redshift 
distribution of source galaxies used in the lensing anal- 
ysis. In cosmic shear analysis, the estimation of source 
redshifts sometimes becomes the most significant source 
of systematic errors that challenge the use of weak lensing 
for cosmological studies (33436j . In addition, one of the 
most significant uncertainties in stacked lensing analysis 
would be cluster centroiding (off-centering) errors which 
arises from the limitation of the mass centroid determi- 
nation from available data [2l|, |37} • 

Clusters of galaxies are biased tracers of the underly- 
ing mass distribution. Thus one can obtain constraints 
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on the growth of structure from spatial clustering of mas- 
sive clusters as well. Furthermore, the bias contains infor- 
mation on the cluster mass, which offers another oppor- 
tunity to infer cluster masses in a statistical way. This 
opens up a possibility to calibrate the mass-observable 
relation from observing data themselves [38l - l40l |. 

In this paper, we explore the potential of future optical 
cluster surveys, such as the Hyper Suprime-Cam (HSC) 
[ilj . Dark Energy Survey (PES) [Hj], and Large Synoptic 
Survey Telescope (LSST) [43| , for constraining cosmolog- 
ical parameters. In these surveys, many clusters out to 
high-redshift (z > 1) will be identified in multi-color op- 
tical images, or with aid of Sunyaev-Zel'dovich (SZ) ef- 
fects observed by Atacama Cosmology Telescope (ACT) 
0IH, South Pole Telescope (SPT) g|, or the all-sky 
Planck survey jlOC| . We combine number counts of these 
clusters with stacked weak lensing measurements, i.e., 
tangential shear profiles around the clusters taken in the 
stacking average. At small-scale the shear profiles con- 
strain mean masses of cluster samples, whereas large- 
angle tangential shear profiles serve as a clean probe of 
halo-mass cross power spectrum. The cluster number 
counts and tangential shear profiles respectively contain 
important systematics, the mass-observable relation and 
the source redshift uncertainty. We show how the com- 
bination of these two observables can help self-calibrate 
these systematics to obtain robust constraints on cosmo- 
logical parameters. 

This paper is organized as follows. In Sec. [Til we sketch 
out the basic idea of our technique, focusing on how we 
can overcome various difficulties inherent in cluster cos- 
mology using the stacked weak lensing. We calculate 
the signals in Sec. IIII1 and the covariance between ob- 
servables in Sec. IIVI We show results on the cosmo- 
logical parameter forecast in Sec. [V] We discuss pos- 
sible systematics in Sec. I VII and conclude in Sec. IVIII 
We choose the best-fit cosmological parameters from 
the seven-year Wilkinson Microwave Anisotropy Probe 
(WMAP) observations 47] as our fiducial cosmological 
model, with the matter density Qm = 0.266, baryon den- 
sity fi;, = 0.04479, the dark energy density J7de = 0.734 
and its equation of state w(a) — w + (1 — a)w a = —1, 
the dimensionless Hubble constant h — 0.710, the power 
spectrum tilt n s — 0.963, and the normalization of the 
power spectrum as = 0.801. Throughout the paper we 
assume a flat universe, i.e., Qm + ^de = 1, and use the 
unit c = 1 for the speed of light. 



II. BASIC IDEAS 

The lensing distortion signal for a source galaxy in a 
particular direction on the sky and at redshift z s is 
expressed as 



7(0; z s ) = AttG I dz a%(z) 
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where x — xi z ) is the comoving angular diameter dis- 
tance to redshift z, which is same as the comoving radial 
distance for a flat universe, AE denotes the mass distri- 
bution field along the line of sight, and the redshift is 
related to the scale factor a via 1 + z = 1/a. 

One of the most serious systematic errors in weak lens- 
ing is the photometric redshift error, i.e., uncertainty in 
estimating redshifts of source galaxies. Since a spectro- 
scopic survey of all the source galaxies is observationally 
too expensive, in most cases redshifts of source galaxies 
have to be inferred based on their broadband photome- 
tries, the so-called photo- z information. For cosmic shear 
tomography [48l - l50{ . which is potentially the most pow- 
erful cosmological probe, future surveys need to calibrate 
mean redshifts of each tomographic bins to about 0.1% 
in order for constraints on cosmological parameters not 
to be significantly degraded [lij . This level of calibration 
is indeed very challenging. 

In this paper, we propose a method which adopts the 
stacked lensing technique for the cosmological purpose, 
yet is insensitive to the photo- z errors. The key idea to 
overcome the source redshift uncertainty is to use a single 
population of background source galaxies to extract the 
lensing information, where source galaxies can be selected 
based on the photo-z information, in order to control and 
to calibrate its uncertainty. 

More specifically, we consider the following situation. 
We study shear signals around a catalog of galaxy clus- 
ters, whose redshifts are assumed to be known either 
spectroscopically or photometrically. The photometric 
redshift estimate of each cluster is much more robust 
and accurate than photo-z estimates of individual galax- 
ies, because photo-z estimates are more secure for early- 
type galaxies and photo-z uncertainties are statistically 
reduced by combining many member galaxies. We as- 
sume that redshifts of source galaxies are not overlapped 
with the cluster redshifts, i.e., all the source galaxies are 
located behind any clusters in the catalog. By cross- 
correlating the shapes of source galaxies with the cluster 
distribution in a particular redshift slice centered at zi, 
we can measure the shear-cluster cross-correlation (or the 
stacked lensing signal) as a function of separation angles 
between the cluster center and the source galaxies: 
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The redshift z Sjm i n denotes the minimum redshift used 
to define a source galaxy sample. The function 
(AE(xz#, zi)) denotes the average mass profile around the 
lensing clusters at the redshift z;, and dp/dz s is the nor- 
malized redshift distribution of source galaxies defined so 
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FIG. 1: A conceptual flow-chart of the method studied in this paper. We propose a joint experiment of combining the cluster 
observables and the shear-cluster correlation (or the so-called stacked lensing) to constrain cosmological parameters including 
dark energy parameters. The most serious systematic error inherent in each observable, the source redshift uncertainty and 
the mass-observable relation, can be calibrated out by combining the two observables (also see text around Eq. [2]). The 
cosmological power of this method arises mainly from the large-angle signals of the shear-cluster cross-correlation (2-halo term) 
and from the cluster observables. In addition, we consider how other systematic errors such as variations in the halo profile 
and the cluster centering offset affect cosmological constraints. 



as to satisfy dz s dp/ dz s = 1. Thus the dependence of 
the stacked lensing on source redshifts is only via a sin- 
gle quantity, the average (x(zs)" 1 ) over the source galaxy 
population, and its effect is just to cause an offset in the 
overall amplitude of the stacked shear profile. 

Eq. ^ indicates that the stacked lensing method in 
principle allows one to extract the lensing contribution 
at each cluster redshift from the total shear signals in 
source galaxy images. Thus, if cluster samples at differ- 
ent redshift slices are available as we study below, we can 
measure the redshift evolution of the stacked lensing sig- 
nals, which provides a clue to the tomography of shear 
signals, i.e., how the shear signals in source galaxy im- 
ages are built up over structures along the line of sight. 
A key in this step is that the stacked lensing signals 
at different redshifts all depend on the single quantity 
(xizs)' 1 )- Hence, even if the mean redshift of source 
galaxies is completely unknown as an extreme case, the 
quantity (x(^ s )" 1 ) can be self-calibrated by combining all 
the stacked lensing signals, as long as the stacked lensing 
signals at different cluster redshifts do not carry perfectly 
degenerate information. 

On the other hand, the cluster observables (number 
counts of clusters and the cluster-cluster power spec- 
trum) also involve several systematic uncertainties. The 
most important one is the uncertainty in the mass- 
observable relations. While the theoretical predictions 
for cluster observables are primarily given as a function of 
halo masses, the halo masses need to be inferred from the 
cluster observables, which causes uncertainties because 
of the imperfect relation between cluster observables and 
halo masses. Although previous studies have considered 
a possibility of calibrating the mass-observable relations 
by using clustering information of clusters (e.g., [HI), 
in our case cluster masses in each bin can directly be 
calibrated based on the stacked lensing signals at small 



angular scales, which simply measures the mean mass 
profile of clusters (the l-halo term). 

Thus the weak lensing signals of distant galaxies and 
the cluster catalogs offer a promising synergy. Fig. [1] 
shows a conceptual flow-chart of the method studied in 
this paper. The combination of the two observables, the 
stacked lensing and the cluster observables, enables the 
self-calibration of the two major systematic errors, the 
source redshift uncertainty and the mass-observable re- 
lation. The cosmological power arises mainly from the 
large-angle signals of the stacked lensing as well as from 
the cluster observables, where both the observables are 
given as a function of different bins of cluster mass in- 
dicators (e.g., richnesses) and redshifts. In addition, we 
will carefully address other systematic errors that affect 
the lensing observables, such as variations in the cluster 
mass profile, the offset of cluster centers, and a possible 
multiplicative uncertainty in estimating the lensing shear 
amplitude from galaxy shapes. 

For a further clarity, the setup of the method is illus- 
trated in Fig. [SJ Unlike the conventional weak lensing 
tomography, we consider a single population of source 
galaxies defined so as not to be overlapped with clus- 
ters. We assume that redshifts of the clusters can be 
measured accurately, and clusters are divided into red- 
shift bins. Since all cluster samples in different redshift 
bins share exactly the same source population, the bias in 
the source redshift distribution affects the shear signals 
in different redshift bins coherently in a way easily pre- 
dicted by theory, which is the reason the source redshift 
uncertainty can be calibrated. 

The self-calibration of the source redshift uncertainty 
and the mass-observable relations is feasible only when 
these two uncertainties are not completely degenerate. 
We check this point in Fig. [3J By increasing the 
source galaxy redshift, the stacked lensing signal at scales 
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FIG. 2: The setup assumed in the paper. The population of 
source galaxies for weak lensing analysis is assumed to be lo- 
cated behind any clusters used for cosmological analysis (top). 
We study the cluster-shear cross correlation (whose signal is 
proportional to the lensing weight function W K (z); middle) 
and the number of clusters as a function of cluster redshifts 
(bottom). 
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FIG. 3: Upper panel: The sensitivity of the tangential shear 
(y+)(9) at 9 = 10' (roughly corresponding to the virial ra- 
dius for a cluster with the mass M = 10 15 h~ x Mq), as a 
function of the cluster redshift z. We show the fractional 
change of 7+ by changing the mean redshift of the source 
galaxy distribution z m (solid) and the dark energy equation 
of state wo (dotted) and w a (dashed). See Sec. IIHI for details 
of the setup and calculations. Lower panel: The change of 
(7+}(6>) for different 9, when the mass of the cluster is shifted 
by (dln(7+)(10')/dz m ) x 0.1 in order to mimic the fractional 
change due to Az m =0.1 (shown in the upper panel). The 
solid curve shows the fractional change of (7+)(#) at 9 — 10', 
which indeed mimics the slid line in the top panel, wheres the 
dotted curve indicates the change at 9 = 1'. At 9 ^> 10' the 
stacked lensing signal is dominated by the 2-halo term, whose 
mass dependence is simply the halo bias bh(M); the fractional 
change of bh (M) is shown by the dashed curve. 



around the virial radius, 10' considered here, increases 
monotonically with increasing cluster redshift. Although 
we can shift the mass of the cluster in a way to mimic 
this trend, the different mass dependence on the tangen- 
tial shear at different radii suggests that we can break 
the degeneracy between the source redshift and the mass- 
observable relation by observing tangential shear signals 
over a wide range of radii. Furthermore, the trend is 
quite different from the changes of the tangential shear 
due to different dark energy equation of state, suggest- 
ing that these uncertainties are not degenerate with the 
uncertainty of cosmological parameters either. 



III. COSMOLOGICAL OBSERVABLES 

A. A Flat Cold Dark Matter Model 

Throughout this paper we work in the context within 
a spatially flat universe. The Hubble expansion rate, 
H(z) = (da/dt)/a, is given by the scale factor a(t) and 
is given in terms of energy density parameters as 

H 2 (a) = Hi \n M a- 3 + n^e^ ^ da ' {1+w{a ' ))/a '\ , (4) 
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where Hq = lOO/i km s~ x Mpc -1 is the present-day Hub- 
ble parameter. The parameter w(a) specifies the equa- 
tion of state for dark energy as w(a) = pde(o) / Pbe(o-) ■ 
Note that J7m + ^de = 1 and we have used the conven- 
tion a(to) = 1 today. The comoving angular diameter 
distance \{ z ) is given as (note 1 + z = l/o), 



x{z) = l dz 'w^y 



(5) 



which is related to the angular diameter distance Da(z) 
via x(z) = (1 + z)D A (z). 

Throughout this paper we adopt a parametrized form 
of the dark energy equation of state. w(a) = Wq + (1 — 
a)w a [5lj | . In this case, the Hubble expansion rate can be 
described as 




n 



M a 



H 2 (a) = H\ 

(6) 

Another important ingredient is the density fluctua- 
tion and its redshift evolution. In linear theory after 
matter-radiation equality, all Fourier modes of the mass 
density perturbation, 8(x)(= Sp m (x)/p m ), grow at the 
same rate, the growth rate. For general dark energy 
equation of state w (a), the growth rate D(a) can be ob- 
tained by solving the following differential equation for 
G{a) = D{a)/a 



d 2 G 
din a 2 



-w(a)n DE (a) 



dG 
din a 



+- [1 - w(a)} fi DE (a)G = 0, (7) 



with the boundary condition G(a) = 1 and dG/dhia = 
at a <C 1. The linear density fluctuation is characterized 
by the linear power spectrum P^(k; z), which is related 
with the primordial curvature fluctuation as 53] 
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(8) 



where T(k) is the transfer function of matter perturba- 
tions with baryon oscillations [HJ], 8^ is the normaliza- 
tion of the fluctuation at ko = 0.002 Mpc -1 following 
the convention in [55| , and a s is the running index of the 
primordial power spectrum. We adopt no running, i.e., 
a, =0 as our fiducial choice. 



FIG. 4: The predicted number distributions dN/dz of clus- 
ters for the survey area of 2000 deg 2 . Here we consider 
three mass thresholds, M obs > W 14 h~ 1 M El , 1O 14 ' 5 /i _1 M , 
and 10 h~ Mq, where the observed mass M b s is related 
with the true mass via Eq. ((9|. Boxes indicate the binned 
uncertainties (see Sec. IIV A|) for the bin size of Az — 0.05. 



1. Number Counts 

The number counts of clusters in a given mass and 
redshift bin is the most fundamental quantity in cluster 
cosmology. The cosmological power of cluster number 
counts arises from their exponential sensitivity to the am- 
plitude of the initial density perturbations. However, to 
implement this experiment, the total mass of each clus- 
ter, which is dominated by dark matter, has to be inferred 
from available observables such as lensing, member galax- 
ies. X-ray and the SZ effect. We follow the previous work 
[40j | to model the mass-observable relation including un- 
certainties in the mass inference from available data. The 
model relation assumes that the probability of obtaining 
the mass inferred from observables, Mobs, for a cluster 
with the true mass of M can be represented by the log- 
normal distribution: 



p(M oha \M) 



' exp [-x 2 (A/ obs )] ' 



2na\ n 



M 



obs 



(9) 



where 



lnM obs -lnM-lnM blas 
x{M ohs ) = 7= . (10) 
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B. Cluster Cosmology 

The galaxy cluster observables we will consider in this 
paper are the number counts and the cluster power spec- 
trum which can both be measured from a given survey 
region. In this section we briefly review the basics of the 
cluster observables. 



Unless otherwise specified, we assume lnMbias = and 
cinM = 0.3, though uncertainties of the mass variance 
a \nM ano - t ne mass bias lnMbi as , including their mass 
and redshift dependences, are fully taken into account in 
cosmological parameter forecasts. 

The average number density of clusters that lie within 
the i-th redshift bin 2^ m ; n < z < Zj )IQ ax and the 6-th 
mass bin defined in terms of the observed mass, Mf,. m ; n < 
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M obs < M b 

.max- 

is given by 

d 2 V [ M >>™* 

niib)= LJ z ^L^ dMohs 

/dfi 
dM—p{M A ,\M) 

where d 2 V/dzdVL = x 2 /H(z) denotes the comoving vol- 
ume element per unit redshift and per unit steradian, 
and Si(h) (M; z) is the selection function in the z-th red- 
shift bin and the 6-th mass bin, defined by 

S l ( b )(M;z) = Q(z - Zi !min )Q(z itmax - z) 

X-[eiic{x(M bimiD )} - erfc{x(M b , max )}] ,(12) 

where Q(x) is the Heaviside step function and erfc(x) is 
the complementary error function. The redshift depen- 
dence of (M) may come from a possible variation of 
the minimum and maximum mass with redshift, and also 
from a possible redshift evolution of In Mbi as and cr\ n m as 
considered later. In the following we will use the indices 
and b,b', . . . to denote the redshift bins and the 
mass bins, respectively, for notational convenience. 

The function dn/dM in Eq. (fTTj) denotes the the halo 
mass function, for which we employ a fitting formula pre- 
sented by (56|. When we include the local-type primor- 
dial non-Gaussianity parametrized by /nl, we employ the 
same method as adopted in [9| to compute the modified 
mass function. 

For a given survey area of fl s the mean number counts 
of clusters in the redshift and mass bin is given by 

Ni( b ) = fi s ni( 6 ). ( 13 ) 

We show in Fig. @] how the number counts of clusters 
scale with redshift and mass threshold. 



2. Cluster- Cluster Power Spectrum 

Another useful cluster observable for cosmology is the 
cluster-cluster correlation function. While clusters are bi- 
ased tracers of the underlying mass distribution, the clus- 
ter bias is, in contrast to the galaxy bias, well described 
by the halo bias. As in the case of the halo mass func- 
tion, the halo bias is accurately predictable for a given 
cosmological model, e.g., based on a suite of A-body 
simulations, and is sensitive to halo mass. For this rea- 
son, combining the cluster number counts with the clus- 
ter correlation function can provide a promising way for 
self-calibrating the mass-observable relation [jj [TJ |38| — 
l40j . Once the mass-observable relation is sufficiently 
calibrated, the cluster correlation function itself carries 
clean cosmological information in the sense that the in- 
formation is mainly from the linear or weakly nonlinear 
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FIG. 5: The cluster-cluster angular power spectrum for clus- 
ters with M obs > 1O 14 /i _1 M and 0.7 < z < 0.8. Boxes 
indicate the binned uncertainties (see Sec. IIVB|I for the bin 
size of 0.1 dex, assuming the survey area of 2000 deg 2 . 



regimes. Furthermore, if the primordial non-Gaussianity 
exists, it induces characteristic scale-dependent features 
on the halo bias at large length scales j8j, which can be 
efficiently extracted by measuring the cluster correlation 
function. 

In this paper we use the Fourier-counterpart of the 
cluster correlation function, the cluster power spectrum, 
for parameter forecasts. Using the Limber's approxima- 
tion [57j], we can compute the angular power spectrum 
between pairs of clusters, which are in the b- and 6'-th 
mass bins, respectively, and at the same i-th redshift bin: 

Cf W )W = jdxWf {b) {z)wf {V) {z) x - 2 P^ (k = -;«), 

(14) 

where P^(k; z) is the linear mass power spectrum at red- 
shift z (Eq. [8]) j and is the weight function defined 

as 

1 d 2 V T dn 

(15) 

Here d 2 V/dxd£l = x 2 an d bh(M; z) is the bias parameter 
for halos with mass M and at redshift z, which should 
not be confused with the mass bin index (b). Again, we 
adopt a fitting formula presented by (5(| in computing 
bh(M;z). When we include the effect of the primordial 
non-Gaussianity on the halo bias, we employ the method 

IH3. 

Fig. [5] shows an example of the angular power spec- 
trum, for clusters in a redshift slice of 0.7 < z < 0.8 and 
with observed masses M bs > 10 14 /i- 1 M Q . The boxes 
around the curve show the expected measurement errors 
for our fiducial survey (see below for details). 
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C. Weak Lensing Observables 

1. Weak Lensing Basics 

Due to the geometrical nature of gravitational lensing, 
the lensing shear amplitude scales with the critical sur- 
face density £ cr it for a given population of source galaxies 



£^ t (z) = 47rGx(*)(l+*)- 



1 - X(z) 



1 



, (16) 



where (x(z s ) _1 ) is the inverse distance averaged over 
the source galaxy population defined by z s > z S! mi n (see 
Eq. [3]). Since we are interested in the statistical weak 
lensing, we here defined the mean critical surface den- 
sity for a lensing object at redshift z, averaged over the 
redshift distribution of source galaxies as in Eq. @. 

To compute the lensing efficiency, we need to spec- 
ify a population of source galaxies. We simply assume 
that photometric galaxies follow the redshift distribution 
given by 



dp 

dz. 



2z 3 



exp 



(17) 



where zq is free parameter, and dp/dz s satisfies the nor- 
malization condition J^°dz s dp/dz s = 1. With this form, 
the mean redshift z m is given as z rn = (z s ) = 3zq. As our 
fiducial choice, we assume zq = 1/3 corresponding to the 
mean redshift z m = 1, which roughly mimics the depth 
of the Subaru Hyper Suprime-cam (HSC) Survey. 

For the method studied in this paper, we focus on a 
sub-population of source galaxies for extracting the lens- 
ing signals (see also Sec. QT] for discussions). The source 
galaxies are defined from galaxies at redshifts greater 
than the minimum redshift z s ,min- Our fiducial choice 
is z s ,min = 1-5 such that source galaxies are behind all 
clusters of interest. Thus the source redshift dependence 
of the lensing efficiency is only via the single quantity, 
(x^ 1 )- We will employ the mean source redshift of the 
whole galaxy distribution (i.e., not the mean redshift of 
the sub-population defined by z s > z s ,min), z rn , to de- 
scribe the source redshift dependence in making param- 
eter forecasts. 

The lensing shearing effect on source galaxy images 
is caused by the foreground structure along the line of 
sight. The shear amplitude for a given galaxy in the 
angular direction 6 on the sky is closely related to the 
wei ghte d mass density field along the line of sight (e.g., 
see (13): 



?(0)<-«(0) = Jd X W K (z)S m ( X 6, X ), 



(18) 



where the vector notation 7 is used to explicitly show that 
the shear field is a spin-2 field, i.e., has two components 
at each position on the sky, k is the so-called convergence 
field, and <5 m (x#, X ) is t ne mass density fluctuation field. 



Note that lensing does not occur for a completely homo- 
geneous universe with S m = 0. The function W K {z) is 
the lensing weight function defined by 



W K (z) 



Prn(z) 



(1 + z)Ecrit(z)' 



(19) 



where p m (z) denotes the mean mass density of the uni- 
verse at redshift z. 

Using the Limber's approximation, we can compute 
the lensing power spectrum for the source galaxy popu- 
lation defined above as 

C™(£) = Jd X [W K (zf X - 2 P^[k = ^ z y (20) 

where P^ L (k) denotes the nonlinear mass power spec- 
trum. We use the fitting formula in [59] for computing 
P r ^ L (fc) for a given cosmological model. 



2. Shear- Cluster Power Spectrum: Stacked Lensing 

As briefly described in Sec. [TT1 if a catalog of galaxy 
clusters in the similar redshift range is available, the con- 
tribution of the clusters to the lensing signals of back- 
ground source galaxies can be extracted by measuring 
a cross-correlation between the cluster distribution and 
the shapes of source galaxies - the so-called shear-cluster 
correlation or the stacked lensing. For the limit that a 
redshift slice of clusters, centered at zi, is sufficiently thin, 
the shear-cluster correlation is given by 

( 7+ ) (*;*,) ee J- (n cX {e'-z l)l+ {e e')) = A Y x f\ Zl) , 

n c i Ecrit(zz) 



(21) 

where X i = x( z i) and A£ m (x/#;z/) is the radial profile 
of the projected mass density around the cluster center. 
Observationally the shear-cluster correlation can be mea- 
sured by averaging the tangential ellipticity component 
of source galaxy images in each radial bins over all the 
pairs of background galaxies and clusters in the sample, 
where the tangential ellipticity is defined with resp ect to 
the cluster center in each cluster region [!(| H3, [HJ] . Note 
that the cluster distribution is treated as discrete, point 
distribution, and a representative point for each cluster 
needs to be chosen for each cluster, the cluster center in 
this method [60| . 

There are notable advantages in the stacked lensing 
technique, e.g., compared to cosmic shear. (1) As dis- 
cussed at length in Sec. [Ill the source redshift uncertainty 
can be calibrated by combining the stacked lensing sig- 
nals of different cluster redshifts. (2) The stacked lens- 
ing profile is less sensitive to substructures within and 
asphericity of the individual clusters and also to uncor- 
rclatcd large-scale structure along the same line of sight. 
Since the signal probed is linear in shear, which can be 
both positive and negative due to its spin-2 field na- 
ture, these contaminating effects are averaged out via 
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the stacking, if a sufficient number of clusters are avail- 
able and as long as the universe is statistically isotropic 
and homogeneous. As a result, the stacked lensing tech- 
nique probes the average mass distribution around the 
clusters, which can be considered as a spherically sym- 
metric profile. Therefore the stacked lensing profile is 
one-dimensional, i.e., given as a function of radius from 
the cluster center. (3) The stacking boosts the signal to 
noise ratios of shear signals at very large radii, where the 
shear detection is limited by too small signal for an in- 
dividual cluster. The large-angle shear signal is in the 
linear or weakly nonlinear regime, and hence carries rel- 
atively clean cosmological signals. (4) If source galaxies 
are not overlapped with cluster redshifts, another serious 
contamination to cosmic shear, the intrinsic ellipticity 
alignment [6l|, does not affect the stacked lensing sig- 
nals. 

Assuming the fiat-sky approximation, we can express 
the stacked lensing profile in terms of the angular power 
spectrum C*^ (£) as 

f fiif 

<7+> i( 6)(0) = J -^CfaWUlQ), (22) 

where J2 (x) is the second-order Bessel function. The full- 
sky expression can be recently formulated in [62(, where 
it was shown that the effect of the celestial sphere causes 
only a few percent change in the shear amplitude at very 
large angles greater than 10 degrees. We do not think 
that the inaccuracy of the flat-sky approximation largely 
changes the following results, but a more rigorous calcu- 
lation will be our future work. The angular power spec- 
trum contains the equivalent information to the stacked 
lensing profile. Hence we will use the power spectrum 
in conducing parameter forecasts for computational sim- 
plicity. For example, the covariance calculation is easier 
in Fourier space than in real space. 

The stacked shear profile arises from two contributions. 
The shear signal at angular scales smaller than a typical 
projected virial radius of clusters arises from the average 
mass distribution within one halo, the so-called 1-halo 
term, while the large-angle shear arises from the average 
large-scale structure surrounding the clusters, the 2-halo 
term. In what follows, we model the shear-cluster corre- 
lation function based on the halo model. 



3. Stacked Lensing: 1-Halo Term 

As described above, we need a model of the radial 
mass profile of halo in order to compute the 1-halo term 
at small angular scales. In this paper, we employ the 
Navarro-Frenk- White (NFW) profile 63] as used in [60l |. 
The NFW profile is defined by 



p(r) 



(r/r s )(l+r/r s ) 2 



(23) 



The density parameter p s is specified by imposing that 
the mass enclosed within a sphere of the virial radius is 



equal to the virial mass M 
_ A{z)p m (z)c 3 



M 



3m nfw (c) 47rrfm nfw (c) ' 



(24) 



where 



m nfw (c) = / j^—^dr = ln(l + c) - — °— . (25) 



(i + O 2 



1 + c 



Here c = r v j r /r s is a concentration parameter, A(z) is a 
nonlinear overdensity that defines the virial mas based on 
the spherical collapse model (we adopt a fitting formula 
by [64lp. Motivated by the result of iV-body simulations 
(e.g., 65]), we adopt the following model for the mass 
and redshift dependence of the concentration parameter 



c(M, z) = A, 



M 



2 x I0 l2 h- 1 M, 



(1 



(26) 



Here A v i r , B v i r and C V i r are free parameters, and we will 
employ (A vil , B vir , C vir )=(7.85, -0.081, -0.71) as our 
fiducial parameter set according to the result in [66j . 

Using the method developed in [32| (also see (s3|), the 
Limber's approximation and the flat-sky approximation, 
we can derive the power spectrum for the 1-halo term of 
shear-cluster correlation between the source galaxies and 
the clusters in the i-th redshift slice and the 6-th mass 
bin. We find that the 1-halo term is described by 



Kb) 



dz 



d 2 V 
dzdfl 



dn 



x I dM—S m (M;z)R M (£;z), (27) 



where Rm(£) denotes the two-dimensional Fourier trans- 
form of the projected NFW profile (i.e., convergence 
k(9)) defined as 



k M (£;z) 



2ir9d9K{9)J Q (£9) 



Mu M (k = e/x;z) 



(l+z)-VScritW 

(28) 

Here Jo (a;) is the zeroth order Bessel function and Ujvf(fc) 
denotes the three-dimensional Fourier transform of the 
normalized NFW profile umJt) = p(r)/M, which has 
the following analytic form [67J 



u M (k) 



1 



m n f w (c) 



[sina;{Si[:r(l + c)] - Si(x)} 



+ cos x {Ci[a;(l + c)} - Ci(x)} - 



sin(xc) 



x(l + c) 



,(29) 



where x = (l+z)kr s , and Si(x) and Ci(x) are sine and co- 
sine integrals. The relation between the two-dimensional 
and three-dimensional Fourier transforms of halo profile 
was also discussed in Sec. 3.2 in 68 1. 
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4- Effect of Halo Centering Offset 

For the derivation of the 1-halo term above, we implic- 
itly assume that the stacking can be done by properly 
selecting the halo center of each cluster, e.g., the Fourier 
transform of halo profile, um(&) (Eq. [29]), is defined with 
respect to the halo center. On individual cluster basis, a 
dark matter halo generally contains many substructures, 
has a non-spherically symmetric mass distribution, and 
has no clear boundary with its surrounding structures, 
all of which reflect the collision-less nature of dark mat- 
ter. Such a complexity, as well as invisible nature of dark 
matter, implies that it is not straightforward to define the 
halo center for each cluster. Observationally the cluster 
center will be chosen from either/both of the position of 
brightest cluster galaxy (BCG) or the peak of the X-ray 
or SZ maps. If the significant strong/weak lensing detec- 
tion is achieved, albeit rare, the cluster center can be in- 
ferred from the mass distribution reconstructed from the 
lensing. Recently we [69| found that, from the detailed 
weak lensing analysis of individual clusters, the BCG 
position is indeed close to the center inferred from the 
lensing-reconstructed mass distribution, although there 
are some clusters whose centers appear to be significantly 
offseted from their BCGs. In what follows, we develop a 
formulation to account for the effect of possible centering 
offset on the stacked lensing. 

As derived in detail in Appendix [X] the halo centering 
offset acts as a smoothing effect on the stacked lensing 
signal. For a spherically symmetric halo (in an average 
sense) , if the halo center is incorrectly chosen with an off- 
set angle o ff from the true center, the measured stacked 
lensing profile in Fourier space is modified as 



(30) 



In this paper, we are interested in the statistical aver- 
age of tangential shears around many different clusters. 
Naturally all clusters do not share the same offset an- 
gle, but rather the offset should follow some probability 
distribution 



KM, s{i) = km(£) / dd o sJ (£6 o g)p o g(6, 



off J 



(31) 



where p(9 Q e) is the probability distribution function of 

#off. 

As implied from the mock galaxy catalog [2l| or lensing 
measurements [69i], one reasonable choice of p o e(0 o s) is a 
sum of the two-dimensional Gaussian distributions given 
by 



(32) 



In this case, we can analytically integrate Eq. (|3~Tj) to 
obtain 



KM. 



(33) 



where a s denotes the width of the Gaussian distribution 
in radian. Again the distribution of the offset angles 
has been explored by studying individual clusters based 
on both numerical simulations and observations 

[69l l7l"l - [73l |. Motivated by the results of such previous 
works, in this paper, we adopt a two-component model 
consisting of clusters whose inferred center align well with 
the true center and clusters with their measured center 
significantly offseted from the true center. Specifically, 
we adopt the following expression: 



/ecu + (1 - /ceil) exp 



(34) 

with / ccn denotes a parameter to specify the fraction of 
clusters which have their inferred center to be consis- 
tent with the true center. We employ / ccn = 0.75 + 
0.05 ln(M/M piv ) with M piv = 3 x lO 14 /!" 1 */© and 
Da(z)(j s = 0.42/i _1 Mpc as our fiducial values, which are 
roughly consistent with the results obtained from mock 
galaxy catalogs [21 1. 

Thus the 1-halo term of the shear-cluster power spec- 
trum including the halo off-centering effect can be com- 
puted by replacing Km{£) in Eq. (|2T[) with km, ofi{£) in 
Eq. (|34|) for a given model of the halo centering off- 
sets. The real-space stacked lensing profile can be also 
obtained via the transform given by Eq. (|2"2"|) . 

Fig. [HI shows the expected signals for the 1-halo term 
of the stacked lensing profile, with and without the effect 
of halo centering offset (see Fig. 4 in 21] for the similar 
plot). Here we consider lensing clusters with observed 
masses M b s > 10 14 /i _1 M© and in the redshift slice of 
0.7 < z < 0.8. The figure explicitly shows that the halo 
off-centering effect smooths out the stacked lensing sig- 
nal at small angular scales. In other words, for a fixed 
sample of clusters, a better choice of the cluster center 
can be explored to some extent by monitoring the shear 
amplitudes in order to maximize the shear amplitudes at 
small radii. At least, if several choices of the cluster cen- 
ter are available, e.g., from optical, X-ray and SZ data, 
the stacked lensing amplitudes can be compared to ob- 
tain the best inference of the cluster center for stacked 
lensing analysis. 



D. Stacked Lensing: 2-Halo Term 

Now we consider the 2-halo term contribution to the 
stacked lensing signal, which arises from the mass distri- 
bution surrounding lensing clusters. Again by employing 
the halo model formulation (e.g., ;60j), we can derive the 
2-halo term of power spectrum for the stacked lensing 
between the source galaxies and the clusters in the z-th 
redshift bin and the 6-th mass bin as 



= JdxW^ b) (z)W^z)x- 2 Pk 



K k = ~x z 



(35) 
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1. Summary: Halo Model Calculation of Stacked Lensing 
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FIG. 6: Upper panel: The angular power spectrum of stacked 
cluster weak lensing signals from main halos (1-halo term) 
with the effect of off-centering. In this example we consider 
clusters with M obs > 10 14 h~ 1 M Q located as 0.7 < z < 0.8. 
The total power spectrum is shown by the thick solid curve, 
whereas the contributions from populations with and without 
off-centering effect, as expressed by Eq. (|34l) . are shown by the 
dotted and dashed curves, respectively. The thin solid curve 
shows the total power spectrum when the true centers of all 
the clusters are properly identified (i.e., / ccn = 1). Lower 
panel: The corresponding stacked shear profiles in real space. 



In summary, using Eqs. ([2"7)l. and (|55|) . we can 

compute the halo model based prediction for the angular 
power spectrum of the stacked lensing for a given cosmo- 
logical model: 



c 



hre,2h 
(b) 



(£). 



(36) 



The corresponding stacked shear profile in real space is 
computed using Eq. (|22|) . 

Fig. [7] shows the expected stacked lensing signals in 
both Fourier and real spaces, for several different red- 
shift slices. The figure shows that the 1-halo term arising 
from the halo mass profile dominates the signal at small 
angular scales ,$,10' or multipoles greater than a few 
hundred, while the 2-halo term dominates at the smaller 
multipoles or 9 > 10'. The boxes around each curve show 
the expected la measurement errors in each angular or 
multiople bins for a Subaru HSC-type weak lensing sur- 
vey with an area coverage of 2000 deg 2 , which are com- 
puted using the covariance matrix calculations shown in 
the next section. Note that the error bars are uncorre- 
cted between different multipole bins for the power spec- 
trum measurement under the Gaussian error assumption, 
which we will throughout this paper assume, while the 
error bars are highly correlated for the real-space stacked 
shear measurement. 



IV. COVARIANCES OF OBSERVABLES AND 
FISHER MATRICES 

To estimate the accuracies of parameter estimation for 
a given survey, we employ the Fisher matrix formalism. 
In this method we first need to quantify the measure- 
ment errors of cosmological observables for given survey 
parameters that are chosen to resemble planned surveys. 
The covariance matrices of the observables can model the 
measurement errors. Then we can propagate the mea- 
surement errors into uncertainties of parameter estima- 
tion using the Fisher matrix formalism. In this section, 
we present the covariances between the observables de- 
fined in Sec. IIII1 and define the Fisher matrices of each 
observables. 



A. Number Counts 



where the weight functions T / F h (z) and W K (z) are given 
by Eqs. (fl~5|) and (fT9|) . respectively. We have set 
JdM(dn/dM)b(M)Mu M (k) — 1 at angular scales rel- 
evant for the 2-halo term; the prefactor appears in the 
formal derivation of 2-halo term based on the halo model 
formulation (see [uO|). but we used the expression above 
for notational simplicity. We have checked that this sim- 
plification does not change the following results. 



The covariance matrix of cluster number counts 
(Eq. P3]) is given by (Hlzi 

Cov(N i{b) ,N m ) = N Kb) 6fj6& + S i(W) 5%, (37) 

where 8f$ denotes the Kroncckcr delta function; 8^ — 1 

t j t j 

if i = j, otherwise 5^=0. The first term of Eq. (|3~T)) 
denotes the shot noise arising due to the finite number of 
clusters available, and is only non-vanishing when both 
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FIG. 7: The bold solid curves show stacked lensing signals in Fourier (upper-row panels) and real (lower panels) spaces, for 
three cluster redshift slices, 0.2 < z < 0.3 (left panel), 0.7 < z < 0.8 (middle), and 1.2 < z < 1.3 (right), respectively. Note 
that in all the plots we assumed a background source galaxy sample defined by z s > 1.5. The dotted curves indicate the 
contributions from 1-halo and 2-halo terms. While the effect of off-centering is included assuming the two component model 
described by Eq. (|34[) . we also show the case without any offset by the thin solid curves for reference. The boxes around each 
curve indicate the measurement errors for the binned power spectra or the binned shear profiles (see Sec. IIVB|) for the bin size 
of 0.1 dex, assuming the survey area of 2000 deg 2 . 



the redshift bins and the mass bins are same, i = j and 
b = b' . On the other hand, the second term of Eq. (|37|) 
gives the sampling variance contribution, which arises 
because the number of clusters is fluctuated according 
to the large-scale modes of large-scale structure within a 
surveyed region: 



S i{W) ee n 2 s n i{b) n i{v) JdxWl {b) {z)Wf {v) (z)x 

£d£ 
2^ 



w s (m s ) 



k = — ; z 

X 



(38) 



Here W s (£) being the Fourier transform of the survey 
window function. We assume a circular survey geometry 
with survey area Q s — irOg for simplicity, with the result- 
ing survey window function of W s (i) = 2Ji(£Q s )/ (£Q S ). 
The Kronecker delta in the second term of Eq. (|57)> 
imposes that the sampling variance is vanishing for the 



counts of different redshift slices, i.e. i j, assuming 
that the redshift slices of clusters are sufficiently wide 
such that the cluster distributions in different redshift 
slices are uncorrelated. 

Then we can write the Fisher matrix for the cluster 
number counts TV (ee iV{(&)) as 



*a0 



E aA ' 

i,J 



dp a 



(39) 



where the indices I, J run over the cluster redshift and 
mass bins (i, b) andp Q denotes a set of model parameters. 
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B. Power Spectrum Observables 

1. Covariance Matrices and Fisher Matrix 

The angular power spectra we consider in this paper as 
observables are the cluster power spectrum and the shear- 
cluster power spectrum (note that we will also study how 
the result is changed by further adding the cosmic shear 
power spectrum) . Therefore the data vector can be given 
as 



d = a 



(40) 



We need to model the auto- and cross-covariances of these 
elements. 

There are two contributions to the sampling variance 
in the power spectrum covariance, the Gaussian and non- 
Gaussian error contributions, where the later arises from 
nonlinear clustering of structure formation. In this paper 
we ignore the non-Gaussian error covariances for simplic- 
ity. As carefully studied in [75[ (also see [76[ for a simula- 
tion based study), the non-Gaussian errors are significant 
at small angular scales, although the impact on param- 
eter estimation is not large as long as a sufficient set of 
cosmological parameters is included. For example, for the 
case of cosmic shear power spectrum, the non-Gaussian 
errors degrade the marginalized error of each parameter 
only by about 10%. This also likely holds for the case 
of shear-cluster power spectrum combined with the clus- 
ter experiments, although this assumption needs to be 
carefully studied. 

Thus, assuming the Gaussian error covariance, the co- 
variance matrix of the shear-cluster power spectrum is 
given by 



Cov| 



-Itt 



8 K 



.(41) 



The spectra C KK and C^ w) are the observed power spec- 
tra including the shot noise contamination 



°i(bh') 



+ 



1 



„2 



2 il 



gal 



(42) 
(43) 



where n ga \ is the mean number density of source galax- 
ies used for the stacked lensing analysis and a e is the 
rms intrinsic ellipticity for the sum of two ellipticity 
components. For our fiducial Subaru HSC-type survey, 
n ga \ ~ 5 arcmin" 2 for source galaxies at z s > 1.5, and we 
assume a c = 0.35, corresponding to ~ 0.25 for the rms 
per component. The Kronecker delta functions 8^ and 
Slf b , in Eqs. (|4H and (|4*2"j) enforce that the shot noise does 
not contaminate the cluster power spectra between clus- 
ters in different redshift bins and mass bins. Note that 
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FIG. 8: Relative contributions to the diagonal covariance 
matrix elements of the stacked lensing signals as a function 
of multipole £, for clusters with M b s > 1O 14 /i _1 M0 and 
0.7 < 2 < 0.8. From right to left curves, we show the frac- 
tional contributions from the shot noise ("shot"), the cosmic 
shear ("cs"), and the sampling variance ("2-halo") terms. See 
text for more details. 



the shear-cluster power spectrum is not contaminated by 
shot noise. 

Eq. (l4"Tj) implies that the covariance of shear-cluster 
power spectrum consists of several contributions: (1) the 
shot noise term, which is proportional to cr 2 /(ni(f,)n g ai) 
and becomes important at high multipoles; (2) the 
cosmic shear contribution given as (l/n^^C" 1 "; (3) 
the sampling variance (2-halo) contribution given by 

C t(bb>)C KR6 ?j + C i$) C %'y Note that the contributions 
(1) and (2) are non- vanishing only if i = j (the same red- 
shift bins). In Fig. [51 we examine how each terms above 
contribute to the diagonal elements of covariance matrix 
(l4"TT) as a function of multipoles. As expected, the shot 
noise dominates at very small angles or very high multi- 
poles, while the contribution from the cosmic shear is im- 
portant over a range of angular scales from a few arcmin- 
utes to a degree scale (see also [H, [77], [ill for the cosmic 
shear contribution to cluster shear signals). Importantly 
for our method, the angular scales from a few to 10 ar- 
cminutes roughly correspond to the virial radii of massive 
clusters, and the shear signals around the viral radii are 
very sensitive to the halo mass, but rather insensitive 
to halo profile parameters such as the halo concentration 
and the halo off-centering parameter which are more sen- 
sitive to the shape of the shear profile (e.g., see [2(| for 
such a discussion based on the actual weak lensing mea- 
surements). Hence the figure implies that the accuracy 
of halo mass estimation from the stacked lensing mea- 
surement is mainly limited by the cosmic shear contam- 
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iiiation. The sampling variance (3) gives significant con- 
tributions only at very large scales, I < 10 2 (9 > 100'), 
which suggests that this term can safely be neglected in 
the shear analysis of individual clusters for which tan- 
gential shear signals can be detected out to the virial 
radii at most. At the very large scales the cosmic shear 
term dominates again; this can be understood from the 
asymptotic behaviors of these terms, (l/fii^))C KK oc £ ns 
for the cosmic shear and (C hK ) 2 oc (C hh C* KK ) oc (r=) 2 for 
the sampling variance term in the limit of £ 0, where 
n s w 1 is the power spectrum tilt. 

Similarly, the covariance matrix of cluster power spec- 
trum and the cross-covariance matrix of C hh and C hK 
are 



CovC^C^) 



4-7T 



£ — s K 

1)M 13 



Mih 



t(b&) »(&'&') 



«(66') i(b'b) 



,(44) 



and 



4^ 6%, K 
n s (2£+l)A£ ij 

/Vhh /"ih.K, I /Vhh /^fhK (a r\ 



The Fisher matrices of the power spectrum observables 
are given as 



where the indices I, J run over the redshift and mass 
bins i, b and so on. At each multipole, if N z redshift 
slices, Nm halo mass bins in each redshift slice, and Ni 
multipole bins are taken, the dimension of CS^j is N z x 

Nt x N M (N M + l)/2, and that of Cffc is N z x N e x 
Nm- The dimensions of the covariance matrix of each 
power spectrum and the cross-covariance are given by 
their squared or the product at each multipole. Under the 
Gaussian error assumption, there is no cross-covariance 
between different multipoles, and hence the summation 
in Eq. (|16} is done at each multipole. 

In computing the Fisher matrix F^§, we include the 
power spectrum information up to the maximum mul- 
tipole to 4nax = 10 4 , which roughly corresponds to 
^min ~ 1'. While the choice may be conservative in the 
sense that significant signals can be detected at I > 10 4 
(see Fig. [7]), the stacked lensing signals near the cluster 
center are affected by various systematic effects ignored 
in this paper, such as the non-linearity of shear signals, 
the contributions from central galaxies, or more gener- 
ally the effects due to baryonic physics. Our choice of 
f m ax = 10 4 is therefore intended to minimize the impact 
of these systematic effects. Note that we properly in- 
clude various nuisance parameters to model variations in 
the stacked lensing signals that can be quite important 



at ^max < 10 4 , including the halo concentration param- 
eters and the halo off-set parameters. By marginaliz- 
ing the parameter forecasts over these nuisance parame- 
ters, we have checked that the accuracies of cosmological 
parameters are not sensitive to the modes around the 
maximum multipoles, because the modes at multipoles 
£ ^ 10 3 are limited by the shot noise contamination as 
explicitly shown below. We have also checked that the 
results are not largely changed if including up to an even 
higher maximum multipole such as £ mayi = 2 x 10 4 or 
3 x 10 4 . 

On the other hand, the maximum angular scale we 
can observe should be related with the survey area fl s . 
Thus we estimate the minimum multipole £ m i n using the 
relation £ m ; n ~ it J yf^/TT. We use £ m ; n = 8 for our 
fiducial survey area of 2000 deg 2 . 

The shear-cluster and cluster-cluster power spectra 
have fairly smooth features in multipole space. Hence, 
we used the binned power spectra in multipole space to 
perform the summation in Eq. ([46]) over multipoles. We 
have checked that a finner binning of multipoles does not 
change the following results. 

For reference, the covariance matrix of the stacked 
lensing profile in real space can also be computed as in 
[79l | (also see [75| for the similar expression on the covari- 
ance of cosmic shear correlation functions): 

1 f Idl 

Cov[< 7+ > i(b) (e m ),<7+), (fe0 (^)] = n-J^ 

xj 2 (£e m )j 2 (£d n ) [c^^c^^ + 0^0^^ 

(47) 

Here we consider the covariance between angles averaged 
within an annulus between 9k,m\n and 6*fc, max and so on. 
The Bessel function averaged over such a bin is defined 
by 



6d6J 2 (£6). (48) 
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At small scale {9 <C 1) the covariance matrix should be 
dominated by the shot noise. In this limit the covariance 
matrix can be simplified as 



Cov 



2tt(0. 



2 

m,max 



9 2 

m.min 



v - _ (49) 



Eq. (f47|) shows that, even for the Gaussian error case, 
the stacked shear profile of different angles are corre- 
lated with each other in contrast with the power spec- 
trum covariance. The correlations are indeed signifi- 
cant for neighboring bins (e.g., see 80]). Thus to cor- 
rectly extract cosmological information from the real- 
space stacked shear profile, we need to properly include 
the covariances between the measured shear profiles of 
different angles, which is not straightforward compared 
to the Fourier-space based analysis (80j . 
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2. Signal-to-Noise Ratios of Stacked Lensing 

Using the power spectrum covariance we derived 
above, we here study the total signal-to-noise (S/N) ratio 
of the stacked shear measurement, integrated over angu- 
lar scales considered, as a function of cluster redshifts 
and masses. The total S/N gives an indicator on the 
power of a given observable, measured from survey of 
interest, for constraining cosmological parameters (e.g., 
[75|). The total S/N for measuring the stacked lensing 
signals of clusters in the i-th redshift bin and the 6-th 
mass bin can be simply expressed in Fourier space as 



5' 



,c hK )]- 1 c^ ) (i)- 



(50) 



Note that the total S/N in real space is similarly denned 
by using the covariance matrix (|47[) . which has the same 
amplitudes as the S/N above, because the power spec- 
trum and the stacked lensing profile contain the equiva- 
lent information content at two point level. 

Fig. IH1 shows contours of the expected S/N for a Sub- 
aru HSC-type survey of 2000 deg 2 area coverage, as a 
function of cluster masses and redshifts, assuming their 
bin widths of Az = 0.1 and A(logM b s ) = 0.2, respec- 
tively. Again notice that we assume the single population 
of source galaxies in this plot, source galaxies located at 
z s > 1.5. The figure shows that the stacked lensing sig- 
nals are detected at high significance (S/N > 10) over 
a wide range of cluster masses and redshifts. Most of 
the signal-to-noise ratio comes mainly from the small- 
angle lensing signals, where the 1-halo term (Eq. [27] ) 
is dominated. The 2-halo term or equivalently the large- 
angle signals can also be detected at marginal significance 
(S/N > 1) in most mass and redshift bins. For compar- 
ison, we also show the signal-to-noise ratio for the num- 
ber counts, which is similarly defined using Eqs. (|13p and 
([3"T)l . The figure shows that a wide-field survey allows a 
significant detection of the number counts of clusters over 
a wide range of masses and redshifts. 

The S/N estimate above allows us to argue how well 
we can measure the mean mass of clusters in each bin 
from the stacked lensing signals, which is the key in 
achieving the self-calibration of cluster mass-observable 
relation as described in Sec. HU The halo mass is sensi- 
tive to the overall amplitude of the stacked shear profile, 
with the shear amplitude at angular scales around the 
project virial radius roughly scaling with the halo mass 
as 7+ cx M (e.g., |20|). Hence the mass calibration accu- 
racy is roughly estimated as 



AM 



c?ln7 + 
d In M 



(51) 



Fig. [9]implies that the mean mass of clusters can be mea- 
sured to a few percent accuracy for halos with masses of 
a few 10 14 /i _1 Mq and over redshifts 0.3 ;$ z 1. Since 
the shear signals also depend on other parameters such 



as cosmological parameters, the concentration parame- 
ters and the off-centering parameters, the genuine power 
of the mean halo mass estimation has to be realized tak- 
ing into account marginalization over other parameters, 
as will be carefully studied below. 



C. Fisher Matrix: Combining the Cluster 
Experiment and Stacked Lensing 

In summary, the Fisher matrix for the joint experiment 
combining the cluster experiment (the number counts 
and the cluster-cluster power spectrum) and the stacked 
lensing information (the shear-cluster power spectrum) 
can be given as 



uiCC+WL pN 
t a p - t »l3 



F, 



PS 



a/3 : 



(52) 



where F N and F PS are given by Eqs. ^ and ([36) . 
respectively. In Eq. ([521 we assumed that the number 
counts and the power spectrum observables are uncorre- 
cted, where the former and the latter probe one-point 
and two-point correlation information on the underlying 
mass distribution. As carefully studied in [60( , the cross- 
covariance between the number counts and the power 
spectra arises from the three-point correlation of the mass 
distribution, and an inclusion of the cross-covariance does 
not largely degrade the accuracies of cosmological pa- 
rameter estimation as long as a sufficiently large number 
of model parameters are included. Hence, our approach 
above is a good approximation as the first step, although 
a more careful study including the full covariance contri- 
bution will be needed. 



V. RESULTS 
A. Parameters, Fiducial Model and Priors 

1. Fiducial Model and Nuisance Parameters 

We now estimate accuracies of the cosmological param- 
eter estimation given the measurement accuracies of the 
observables, using the Fisher matrix formalism. This for- 
malism assesses how well given observables can constrain 
cosmological parameters around a fiducial cosmological 
model. The parameter forecasts we obtain depend on 
the fiducial model and are also sensitive to the choice of 
free parameters. 

We include all the key parameters that may affect the 
observables within the cold dark matter and dark en- 
ergy cosmological framework. As for our fiducial cosmo- 
logical model, which is based on the WMAP seven-year 
results [55|, we include the following 9 cosmological pa- 
rameters: the density parameters of matter, baryon and 
dark energy are n M h 2 (= 0.134), Q b h 2 (= 0.0226), and 
^de(= 0.734) (note that we assume a flat universe); dark 
energy equation of state parameters are wq(= —1) and 



15 



10 



15 



I 

45 



10 



14 




10 



15 



s 

I 

45 



10 



14 




10 



15 



I 

45 



10 



14 




10 



15 



I 

45 



10 



14 




FIG. 9: The total signal-to-noise ratios for the stacked lensing signals as well as number counts of clusters, assuming the survey 
area of 2000 deg 2 and the bin sizes of Az = 0.1 and A(logM bs) = 0.2. The contours are are spaced by Alog(5/AT) = 0.25. 
We show the signal-to-noise ratios for the total stacked lensing signal C hK (Eq. [50]) (upper-left panel) and the contributions 
to it from 1-halo (upper-right) and 2-halo (lower-left) terms. The signal-to- noise ratios for the cluster number counts are also 
shown for reference (lower-right). Note that the signal-to-noise ratios scale with the survey area il„ (roughly) as oc v^7- 



Cosmological paras. Ode, fW/i 2 , fit. ft 2 , n s , a s , wo, w a , 6^, /nl 

Source redshift z m (= 3zo) Eq. fl7)l 

Concentration parameter: c(M,z) A v - a , B V [ T , C vu Eq. (|26[l 

Halo off-centering paras: / ccn , a s f CC n(M, z) = / ccn ,o + p C en,M ln(M/M piv ) + p CC n,z ln(l + z) Eq. ((34j) 

a s (M,z) = <t s ,o +Pa„M ln(M/M piv ) +p CTs , z m(l + z) 
Mass-obs. relation: lnM b iaa,oinAf lnMbias = lnM b ,o + ELi 9W [ln(M/M piv )f + ^Li Ec L- © 
Q-inM = vinMfi + QzmmA / M piv )Y + ELi ^i„ M ,jZ' 

TABLE I: Parameters included in our Fisher matrix analysis. We include 9 cosmological parameters as well as 24 nuisance 
parameters to quantify the impact of possible residual systematic errors on the cosmological parameter estimation. In total we 
include 33 parameters. The fiducial values of each parameters are given in the text. 



w a (— 0); the primordial power spectrum parameters are 
the spectrum tilt n s (— 0.963), the running spectral index 
a s (= 0), and the normalization parameter of primordial 
curvature perturbation, S<^(= 4.89 x 10~ 5 ), where the val- 
ues in the parentheses denote the fiducial parameter val- 



ues. As an extension of dark energy cosmological model 
we also include the primordial non-Gaussianity which is 
modeled by the local-type parameter /nl(= 0). To be 
more specific we adopt the model pi [58j in computing 
the halo mass function and the halo bias. 
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In addition, we include a number of nuisance param- 
eters to model systematic errors involved in the ob- 
servables. For the mass-observable relation, which is 
parametrized by In A/bias and a\ n m > we allow more com- 
plicated dependences on redshift and halo mass by adopt- 
ing the following form: 

lnM bias (M,z) = lnM b , 

3 3 

+ Qb,i [\n(M/M piv )} % + s h<i z\ (53) 

1=1 2 = 1 

&ln Al{M, z) = (TlnM,0 

3 3 

z\ (54) 

i=i i=i 

where we choose M piv = 3 x 10 14 h" 1 M ( 7 )7 and lnM^o, 
qi, Si, ... are free parameters. The form above has been 
adopted by previous work (e.g., [1JJ). The fiducial values 
for these parameters are cri n A/.o — 0.3, and the other 
parameters such as lnM^o, qh,i and Sb,i are set to zero. 

The source redshift uncertainty affecting the stacked 
lensing signals is parametrized by the mean redshift pa- 
rameter z m (= 1) in the source redshift distribution 
(see Eq. [17]). The systematic errors in the small-scale 
stacked shear profile are the halo concentration param- 
eter parametrized by three parameters, A V i r , £> v ; r and 
C v i r (see Eq. below for their fiducial values), and 
the halo off-centering effect given by the two parameters 
/ccn and a s (Eq. [M] ) . Similarly, to include possible mass 
and redshift dependences for the offset parameters, which 
are poorly known theoretically, we employ a simple form 
given by 

/ ccn (M, Z) = f ce n,0 + Pcen,M m(Af/M piv ) 
T" Pccn,z 

ln(l + z), (55) 
a s {M, z) = a St0 + P<TstM m(M/M piv ) 

+p CTs , z ln(l + z). (56) 

Note that the nuisance parameters for a s are in units 
of /i _1 Mpc. The fiducial values are f ccn — 0.75 + 
0.051n(M/M piv ), i.e., / con , - 0.75, p ccnM = 0.05 and 
Pcen,z = 0; er S! o = 0A2h- 1 Mpc/D A (z), Pa„M = and 

Pa s ,z = 0. 

Thus we include 33 parameters in the Fisher matrix 
analysis; 9 cosmological parameters and 24 nuisance pa- 
rameters (see Table fl] for a brief summary) . Unless oth- 
erwise specified, we do not add any prior on the nuisance 
parameters. 

We also need to specify survey parameters. Thus far 
we have employed the survey parameters that resemble 
the planned Subaru HSC survey (see Eq. [17] )■ For a 
broader application of the method proposed in this paper 
we also consider other survey parameters which roughly 
resemble other imaging surveys being planned, DES and 
LSST. Table [TT1 summarizes the assumed survey parame- 
ters. We adopt the method in [75[ (see around Eq. [20] of 
the paper) to estimate the mean redshift and the avail- 
able number density of galaxies for each survey. 



For all surveys, we employ the same halo mass thresh- 
old M obs > 10 14 /i _1 M Q , which makes it easier to com- 
pare the results between different surveys. We consider 
5 mass bins for each redshift slice, with the bin width of 
A(logA/ bs) = 0.2 (for the highest mass bin, we extend 
the upper limit of the mass to infinity). The maximum 
cluster redshift is set to z max = 1.4, which is approxi- 
mately the maximum cluster redshift detectable via op- 
tical imaging surveys including y-band data, except for 
DES for which we adopt z m ax = 1 given the shallower 
depth. The cluster redshift bin width is Az = 0.1 for all 
the surveys. For the source galaxy population needed for 
the stacked lensing we use galaxies at z s > 1.5 for HSC 
and LSST, and at z s > 1.1 for DES, respectively. Assum- 
ing the depth of each survey we can compute the avail- 
able number density of source galaxies, which is needed 
to quantify the intrinsic ellipticity noise. We simply as- 
sume a e = 0.35 for the rms intrinsic ellipticities (sum of 
two components) for all the surveys. 



2. Priors 

As stated above, unless explicitly expressed, we do not 
use any priors on the 24 nuisance parameters. Therefore, 
we can address how combining the cluster experiments 
and the stacked lensing allows to self-calibrate these nui- 
sance parameters. 

However, using probes of large-scale structure alone 
is not powerful enough to constrain all the cosmologi- 
cal parameters simultaneously. Rather, combining the 
large-scale structure probes with constraints from Cos- 
mic Microwave Background (CMB) temperature and po- 
larization anisotropics significantly helps to lift parame- 
ter degeneracies. In this paper we use the CMB priors 
expected from the Planck satellite mission. When com- 
puting the Fisher matrix for the CMB we employ 9 pa- 
rameters in total, 8 from our fiducial cosmological param- 
eter set (we do not include any constraints on /nl from 
CMB in order to examine the pure power of clusters and 
weak lensing in constraining /nl) plus the Thomson scat- 
tering depth to the last scattering surface (r = 0.089). 
We used the publicly- available CAMB code [8l|, based 
on the CMB FAST 82], to compute the angular power 
spectra of temperature anisotropy, -E-mode polarization 
and their cross-correlation. Note that we ignored B- 
mode spectra arising from the primordial gravitational 
wave. The details of our CMB Fisher matrix calculation 
can be found from 60] . The Fisher matrix for the joint 
experiments combining the CMB information with the 
method studied in this paper is simply given by adding 
the CMB Fisher matrix to the matrix in Eq. ([52]) . i.e., 
F = F CMB + F N + F PS . The Thomson scattering depth 
t is marginalized over before the CMB Fisher matrix is 
added to other constraints. 

By the time when the imaging surveys above come 
online, stringent cosmological constraints will be avail- 
able from the ongoing Baryon Oscillation Spectroscopic 
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HSC (1500deg 2 ) 
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1.0 
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5.2 


1.5 


10 14 < M obs 


5 


0.1 < ^ < 1.4 


13 


8 < t < 10 4 


DES 
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0.7 


10 
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1.1 


10 14 < M obs 


5 


0.1 < z < 1.0 


9 


5 < t < 10 4 
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20000 


1.2 


50 
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1.5 


10 14 < M obs 


5 


0.1 < 2 < 1.4 


13 
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TABLE II: The summary of survey parameters adopted in this paper. The parameters Nm and N z denote the numbers of 
cluster mass and redshift bins, respectively. Note that the survey parameter set of HSC (2000deg 2 ) is same as the fiducial 
parameter set used in previous sections. The parameters ngai^ot and n ga i denote the number densities of source galaxies at all 
redshifts and at redshift z s > z 3 , m i n , respectively, where the latter is used for the weak lensing analysis in this paper. 



Method ct^de) c(wo) cr(w a ) FoM z v - lv <t(/nl) ctQuQm h 2 ) a(\nfl^h 2 ) a(n s ) a(a s ) a(\n8^) 

Planck 0.0102 0.0086 0.0226 0.0068 0.0174 

+N 0.099 0.62 2.17 1.4 0.32 0.0101 0.0085 0.0218 0.0066 0.0169 

+N + C hh 0.035 0.37 1.13 7.5 0.46 15.6 0.0098 0.0079 0.0201 0.0061 0.0158 

+C hK 0.056 0.44 1.12 5.1 0.56 86.9 0.0099 0.0080 0.0205 0.0063 0.0161 

+N + C hh + C hK 0.023 0.22 0.60 28.6 0.56 14.5 0.0059 0.0068 0.0189 0.0058 0.0143 

+N + C hh + C hK - + C KK 0.022 0.22 0.59 32.4 0.57 14.3 0.0053 0.0065 0.0124 0.0038 0.0104 

+N + C hh + C hK +BAO 0.017 0.18 0.48 52.7 0.58 14.2 0.0041 0.0066 0.0188 0.0058 0.0141 



+N + C hh + C hK + C KK +BAO 0.017 0.18 0.47 54.8 0.58 14.0 0.0040 0.0063 0.0123 0.0038 0.0103 



TABLE III: Expected marginalized errors (68% C.L.) on each cosmological parameter for different combinations of cosmological 
probes: the column labeled as "Planck" shows the errors expected from the CMB information alone; the following columns are 
the results expected by further adding the cluster number counts (AT), the cluster-cluster power spectra (C hh ), the shear-cluster 
spectra (C hK ), the cosmic shear spectra (C KK ), and the geometrical constraints expected from the BOSS BAO information 
(see text for details). The errors include marginalization over other parameters including nuisance parameters (see Tab. [TJ> . 
However note that we did not employ any priors on the nuisance parameters. The columns labeled as "FoM" and "z p i v " denote 
the dark energy figure of merit and the pivot redshift, respectively (see text for details). We assume the survey parameters for 
HSC (2000 deg 2 ) in Tab. ITT1 for the cluster counts and the power spectrum measurements (C hh ,C ,hre ). Note that z p - lv denotes 
the pivot redshift at which the dark energy equation of state is best constrained. The element labeled as " — " denotes that the 
parameter is not well constrained by the observables. 

Survey (BOSS) (lflj, which gives a geometrical probe 
of cosmological distances via the baryon acoustic oscilla- 
tion (BAO) experiment [13] ■ Using the method in f&jj . 
we compute the Fisher matrix expected from the BOSS 
experiment. To be more precise, we will add, to the 
Fisher matrix, the expected geometrical information on 
the Hubble expansion rates and the angular diameter dis- 
tances up to the redshifts, z ~ 0.65, which are relevant 
for QMh 2 , f^DE, Wq, w a in our parameters. 

Finally we will also study how the cosmological con- 
straints can be improved by adding another power spec- 
trum available from the same imaging survey data, the 
cosmic shear power spectrum C KK for the same source 
galaxies. Since the cluster-cluster, shear-cluster and cos- 
mic shear power spectra at large angles depend on the 
halo bias bh in different ways, roughly C hh oc b 2 h P m , 
C hK oc bhP m and C KK oc P m , combining all the spectra 
may help to break the degeneracies between the halo bias 
and other parameters further. However, note that the 
self-calibration of nuisance parameters can be achieved 
only if focusing on the single source galaxy population for 
lensing. Thus, we include the cosmic shear power spec- 
trum without tomography. For the cosmic shear C KK , we 



include the spectrum only up to ^ max = 10 3 as the cosmic 
shear at small scales is subject to various uncertainties. 
We take proper account of the cross-covariance of C KK 
with C hK and C hh when adding constraints from C KK . 



B. Parameter Forecasts 

Table HTT1 summarizes the 68% error on each cosmologi- 
cal parameter, marginalized over other parameter uncer- 
tainties including nuisance parameters in Table HI Here 
we have assumed survey parameters for the HSC sur- 
vey of 2000 deg 2 coverage given in Table |Hj The first 
row shows the constraints expected from the Planck 
data alone, while the other rows show the constraints 
by further adding the large-scale structure probes. It is 
shown that the parameter constraints can be continu- 
ously improved by adding the different observables, the 
cluster counts, the cluster-cluster power spectrum, the 
stacked lensing, the cosmic shear power spectrum, and 
the BAO experiment. We should again stress that we 
did not add any priors on nuisance parameters in Ta- 
ble [H and therefore the parameter constraints are as a 
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FIG. 10: Projected 68% confidence limit (A^ 2 = 2.3) constraints in various parameter spaces; Qoe-wo plane (upper-left panel), 
wo-Wa plane (upper-right), f2DE-/NL plane (lower-left), and z m -a\ nM ,o plane (lower-right). The survey area is assumed to 
2000 deg 2 . The different curves indicate constraints from different measurements; the number counts plus the cluster-cluster 
correlation function (dotted), the 1-halo (dot-dashed) and 2-halo (short dashed) terms of stacked lensing signals and their sum 
(long dashed), and the total constraints from the combination of all measurements (solid). The Planck prior is added in all 



result of the self-calibration of those systematic effects 
attained by combining the different probes. The pre- 
vious works d, [ill l38 - i40j have studied the combina- 
tion of cluster number counts and cluster-cluster correla- 
tion functions (so-called "self-calibrated cluster counts"), 
which is essentially represented by the column denoted 
by "+iV + C hh " . We find that an additional calibration 
of cluster masses from the stacked weak lensing signals 
significantly improves constraints on each of the dark en- 
ergy parameters, by a factor of 2 or so, even when vari- 
ous uncertainties associated with stacked lensing signals, 
such as the source redshift uncertainty, off-centering, and 
concentration parameters, are marginalized over. The re- 
sulting constraints on dark energy equation of state is in 
fact quite comparable to the constraints expected from 
tomographic cosmic shear without any marginalization 
over systematic errors (75j . 

A useful parameter that quantifies the power of a given 



survey for constraining dark energy equation of state is 
so-called the Figure of Merit (FoM) denned in [l| : 



a{w a )a(w p ) v /dct(Cov[w ,Wa])' 

where w p is dark energy equation of state at the "pivot" 
redshift, at which the dark energy equation of state is 
best constrained [l|. The FoM is inversely proportional 
to the area of the error ellipse in the wo-w a plane. Thus, 
the larger the FoM is, constraints on dark energy mod- 
els are tighter. Table Mil shows that adding the stacked 
lensing signals improves the FoM by a factor of ~ 4 com- 
pared with the cluster experiment alone (number counts 
plus the cluster-cluster power spectrum) in our fiducial 
setting. This result appears to be consistent with a re- 
cent similar analysis by [§5j], although no systematic er- 
ror associated with the stacked lensing analysis has been 
considered in their Fisher matrix analysis. 
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FIG. 11: Marginalized errors on lnMbias {upper panel) and 
o"inAf (lower), the parameters in the mass-observable rela- 
tion (see Eq. [9)), as a function of cluster redshifts, con- 
strained from the combination of all the observables (except 
for the cosmic shear and BAO). The errors are shown for 
three different cluster masses, M = lQ 1 * h^M® [solid curve), 
3 x lO^/T^M© {dotted), and lO ls ft _1 M {dashed). 



It would be also informative to study how a different 
combination of the observables can break the parameter 
degeneracies. Fig. [TU] shows projected 68% error ellipses 
in various two-parameter sub-spaces of the parameters. 
The dotted and long-dashed curves show the error el- 
lipses expected from the cluster experiments (the number 
counts plus the cluster-cluster power spectrum) and the 
stacked lensing, respectively. For comparison the dot- 
dashed and short-dashed curves show the results from 
each of the small-angle (1-halo term) and large-angle (2- 
halo term) signals of the stacked lensing, respectively. 
The innermost solid curve is the ellipse by combining all 
the measurements (except for the BAO and the cosmic 
shear power spectrum), showing that the ellipse signif- 
icantly shrinks compared to any of the ellipses of one 
particular observable. This improvement of parameter 
constraints is due to the self-calibration of nuisance pa- 
rameters, and the lower-right panels explicitly shows one 
example: the mean source redshift is directly estimated 
from the observables (self-calibrated), without any prior, 
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FIG. 12: The improvements of the marginalized constraints in 
the wo-w a plane by adding the cosmic shear power spectrum 
(C KK ') for the same source galaxy population as that used in 
the stacked lensing analysis, and the BAO geometrical infor- 
mation expected from the SDSS-III BOSS survey. 



to an accuracy of <r{z m ) <~ 0.1 or so. The accuracy cor- 
responds to cr(7+) ~ 0.01 — 0.02 for the cluster redshift 
z ~ 0.5 — 1 (see also Fig. [3]). Finally our results indicate 
that primordial non-Gaussianity can be constrained to 
c(/nl) ~ 15, which is mainly from the cluster observ- 
ables (see also [9l. ITl|). 

To see how well cluster masses are calibrated by the 
combination of stacked weak lensing and the cluster ex- 
periments, in Fig 1111 we show the marginalized 68% er- 
ror on the parameters in the mass-observable relation, 
lnMbias and <J\ n M, as a function of cluster masses and 
redshifts, based on Eqs. (153)) and ([53)1. We find that the 
halo mass bias lnMbias is constrained to a(lnMbias) ~ 
0.07 for a wide range of cluster masses and redshifts, in- 
dicating that cluster masses are determined at ~ 7% ac- 
curacy, even the other parameters are marginalized over. 
The mass calibrations of more massive clusters are less 
accurate, simply because of the smaller number of clus- 
ters available for the stacked lensing analysis (see also 
Fig.©. 

Fig. [12] shows how the dark energy constraints, wq and 
W a , can be further improved by adding the cosmic shear 
power spectrum of the same source galaxy population 
{z s > 1.5) and the BOSS BAO information. The figure 
shows that the BOSS BAO information is useful to im- 
prove the dark energy constraints. More quantitatively, 
the marginalized errors on wq and w a are improved, from 
a{w ) = 0.22 and a{w a ) = 0.60, to a(w ) = 0.18 and 
o~(w a ) = 0.47 by adding both the cosmic shear power 
spectrum and the BOSS BAO information. The FoM 
value is improved by a factor of 1.9, from 28.6 to 54.8 
(see also Table HTT|) . For comparison, the BOSS BAO 
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FIG. 13: The Figure of Merit (FoM) from the combination 
of the stacked lensing, the number counts, and the cluster- 
cluster correlation function, as a function of the maximum 
cluster redshift jz max - The filled squares indicate the case that 
the minimum source redshift is fixed to z s , m in = 1-5, whereas 
the case that the minimum source redshift is set to z s , m i n = 
•Zmax + 0.1, is shown by the filled triangles. 



information alone (with the Planck prior) has the FoM 
value of 19.2. Thus the BAO information appears to be 
highly complementary to the constraints from the imag- 
ing survey. 

The stacked weak lensing has been thought to provide 
a powerful method to constrain the concentration-mass 
relation (e.g., (86[). However, in our Fisher matrix analy- 
sis, the marginalized errors on the concentration param- 
eters, cr(A vir ) = 3.9, a(B vir ) = 0.09, and er(C vir ) = 0.3, 
are not so impressive. The reason is that these param- 
eters are strongly degenerate with each other, and also 
with off-centering parameters (/ CO n and a s ). Our result 
therefore may suggest a critical difficulty in measuring 
the concentration-mass relation from the stacked weak 
lensing technique. Rather the information can be effi- 
ciently extracted by combining the weak and strong lens- 
ing information on individual cluster basis (e.g., |87H89| ). 

To understand the sensitivity of cluster redshifts to the 
dark energy constraints, we study how the dark energy 
constraints change with the maximum cluster redshift. 
Fig. [13] shows the FoM as a function of the maximum 
cluster redshift z max used for the analysis, for the same 
bin width of Az = 0.1. We find that FoM changes rapidly 
with the maximum cluster redshifts particularly below 
1, indicating the importance of deep cluster surveys to 
include clusters out to z > 1. We do gain by adding 
clusters at z > 1 if we fix the minimum source redshift 
Zs.min to z s , m j n = 1.5. However, the improvement in the 
FoM at z > 1 becomes rather small if we set z s ^\ n = 
z max + 0.1, because the additional information from high 
redshift clusters can be to some extent compensated by 
the change of the number density of background source 



Effect of Priors 



Thus far we adopted no prior on the nuisance param- 
eters. However, in practice, we expect that there are 
external constraints on some of these parameters. For 
instance, photometric redshifts, which are required to de- 
fine the source galaxy sample in the first place, naturally 
provide some prior information on z m . The fraction and 
offset distributions of off-centering galaxies may be in- 
ferred from mock galaxy catalogs in TV-body simulations 
[2l| or directly from a detailed measurement of individ- 
ual clusters [69( . For cluster observables we may be able 
to further refine the mass-observable relations by using 
detailed studies of individual clusters or combining dif- 
ferent information from optical, lensing, X-ray and SZ 
measurements. Here we explore how such priors help to 
improve the parameter constraints. 

The result is shown in Fig. [TJ] As specific exam- 
ples, we add priors on the mean redshift of the source 
galaxy distribution, z m , on the standard deviation of the 
cluster offsets, a s , on the halo mass bias of the cluster 
mass-observable relation, InAfbias, and on the mass scat- 
ter cinM- We find that the prior a{z m ) < 0.1 indeed 
improves constraints on dark energy, although the im- 
provement of the FoM is only up to about 10%. This 
again implies that the self-calibration of the source red- 
shift is achieved by combining the different observables, 
well enough not to degrade our cosmological results. We, 
however, comment that a marginally accurate photo- z in- 
formation is needed for selecting source galaxies for our 
stacked lensing analysis. An imperfect photo- z informa- 
tion causes a contamination from foreground or cluster 
member galaxies to the source galaxy population, which 
dilutes the stacked lensing signals. We will come back 
to this issue later. On the other hands, Fig. 1 141 indicates 
that priors on cr s help improve the FoM. The result, in 
turn, confirms the previous claims that the off-centering 
is one of the biggest uncertainties inherent to the stacked 
weak lensing analysis plj . 

The priors on the mass-observable relations have a 
greater impact on the FoM, because the dark energy 
constraints are mainly from cluster observables in the 
method studied in this paper. A few percent level prior 
on In A/bias or <J\ n M, if available, can improve the FoM 
by a factor of 2 and 1.3, respectively. The result suggests 
the importance of detailed studies of individual clusters 
for accurately calibrating the mass-observable relation, 
as has been done in 12a, KK| • 



D. Predictions for Various Surveys 

Next we tune survey parameters to several future wide- 
field optical imaging surveys, in which the stacked clus- 
ter lensing analysis studied in this paper is feasible, to 
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prior on D a (z)ct s [h"'Mpc] prior on <7 lnM 

FIG. 14: The Figure of Merit (FoM) as a function of priors on various nuisance parameters to model the systematic errors: the 
mean source redshift z m (upper-left panel), the cluster off-centering parameter a s (lower-left; in units of /i _1 Mpc), the mass 
bias parameter of the cluster-mass observable relation InMbias (upper-right), and its mass scatter ai n M (lower-right). For the 
parameters a s , InMbias and cr\ n M, we add the priors on parameters in Table U) the overall amplitude parameter, the halo mass 
dependence, the redshift dependence or all the parameters. 



study/compare their powers as a dark energy probe. 
Specifically, we consider the following surveys: HSC, 
DES and LSST. 

The marginalized errors on dark energy parameters 
and /nl are summarized in Table IIV1 and the pro- 
jected error ellipses in the wo-w a plane are presented in 
Fig. [13 The difference of FoM between HSC of 1500 and 
2000 deg 2 , and LSST surveys can be understood by their 
differences in the survey area. The FoM roughly scales 
with the survey area as FoM oc f2 s , and the difference 
between the FoM values of HSC and LSST can approxi- 
mately be explained by this relation. On the other hand, 
the FoM from DES is rather small despite its large area 
coverage of 5000 deg 2 . This is because DES is shallower 
than HSC and LSST, thereby yielding a narrower win- 
dow of the redshift sensitivity to dark energy parame- 
ters due to the smaller number density of source galaxies 
and the lower maximum cluster redshift (see Table [TTJ) . 



In all the surveys, the FoM is improved by a factor of 
2 — 4 by adding the stacked weak lensing signals (see 
also Fig. I15p . suggesting the powerful complementarity 
of stacked weak lensing to cluster cosmology. Table [V] 
summarize improvements of constraints in each survey 
by adding the cosmic shear measurement for the same 
source sample and the BOSS BAO information. 

VI. DISCUSSIONS ON OTHER SYSTEM ATICS 

While our analysis properly takes account of a number 
of key systematic effects, such as the mass-observable re- 
lation, the mean source redshift for weak lensing analysis, 
and the offset of cluster centers, there are still a number 
of potential systematics that could affect our results. In 
this section, we briefly discuss these effects. 

Perhaps one of the most important effects ignored in 
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N+C h 



N+C hh +C hK 



Survey o-(flpE) ct(wq) g(m a ) FoM ct(/nl) cx(^de) q-(wq) c(w a ) FoM <t(/nl) g(OpE) ff(wo) FoM <j(/nl) 



HSC (2000deg 2 ) 
HSC (1500deg 2 ) 

DES 

LSST 



0.035 0.37 1.13 7.5 15.6 0.056 0.44 1.12 5.1 86.9 

0.040 0.42 1.26 6.0 18.0 0.062 0.48 1.24 4.1 99.9 

0.026 0.27 0.82 14.0 10.9 0.053 0.52 1.39 4.7 66.3 

0.011 0.12 0.38 59.4 2.3 0.016 0.12 0.29 54.0 13.0 



0.023 0.22 0.60 28.6 14.5 

0.026 0.26 0.69 21.9 16.8 

0.020 0.22 0.62 30.7 10.3 

0.007 0.06 0.16 322.2 2.2 



TABLE IV: Forecasts of cosmological parameters estimated from the Fisher matrix analysis. We compare three upcoming 
surveys, HSC, DES and LSST. All errors are 68% errors, after marginalizing over the other parameters, expected from the 
combination of the cluster number counts (N), the cluster power spectra (C hh ), and the stacked weak lensing signals (C hK ). 
No prior is added on nuisance parameters. The Planck prior is added to all cases. 
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FIG. 15: Marginalized constraints in the wo-w a plane for three different future surveys, HSC (upper-left and upper-right panels), 
DES (lower-left) , and LSST (lower-right) . The different curves are as in Fig. 1101 i.e., the dotted curves denote constraints from 
the cluster number counts and the cluster-cluster correlation functions, the dashed curves are constraints from the stacked weak 
lensing (both 1- and 2-halo terms), and the solid curves show the combined constraints. While the Planck prior is added to all 
the ellipses, no prior is added to nuisance parameters. 



this paper is the error on cluster redshifts. In the future 
imaging surveys as considered above, cluster redshifts are 
thought to be measured using the photometric redshift 
technique. [9l| studied how the photo-z errors of clusters 
affect the cluster number counts, and then argued that 
photometric redshifts have to be accurate at < 1% level 



in order not to degrade dark energy constraints signifi- 
cantly In fact this accuracy is almost reachable in future 
surveys, as we can use several tens of member galax- 
ies to determine cluster redshifts accurately (e.g., [92|). 
We should comment that, if a redshift survey of Lumi- 
nous Red Galaxies (LRGs) is available from the imaging 
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+C KK +BAO +C KK +BAO 



Survey u(Ode) a(w ) a(w a ) FoM ct(/nl) t(^de) cr(wo) g(Wg) FoM ct(/nl) Q-(^de) cr(wo) cr(wg) FoM ct(/nl) 

HSC (2000dcg 2 ) 0.022 0.22 0.59 32.4 14.3 0.017 0.18 0.48 52.7 14.2 0.017 0.18 0.47 54.8 14.0 

HSC (1500deg 2 ) 0.025 0.25 0.67 24.8 16.5 0.018 0.20 0.52 45.0 16.3 0.018 0.19 0.51 46.7 16.1 

DES 0.019 0.21 0.60 36.3 10.1 0.016 0.18 0.48 53.4 10.0 0.015 0.17 0.48 56.4 9.9 

LSST 0.006 0.06 0.16 348.1 2.2 0.006 0.06 0.16 338.9 2.2 0.006 0.06 0.16 361.2 2.2 

TABLE V: The improvements of cosmological constraints by adding extra constraints from the cosmic shear measurements 
(C KK ) using the same source galaxy sample as used in the stacked weak lensing analysis and from the baryon acoustic oscillation 
observations expected from the SDSS-III BOSS (BAO). 



survey region, such as the SDSS survey, the LRG cata- 
log can be used to construct a secure catalog of clusters 
with spectroscopic redshifts. The LRGs are very likely 
to be one of the brightest cluster galaxies, and are rel- 
atively easy to measure their spectroscopic redshifts as 
they are bright. Thus the LRG catalog is enormously 
helpful in constructing a homogeneous catalog of clus- 
ters, by searching for member galaxies surrounding each 
LRG with deeper imaging survey. This is indeed the case 
for HSC, as the target region of HSC is overlapped with 
the survey region of the SDSS-III BOSS. 

An imperfect photometric redshift estimate from lim- 
ited color information may result in a leakage of fore- 
ground galaxies to the source galaxy sample. This con- 
tamination not only dilutes the shear signals particularly 
near the cluster centers, but also complicates the self- 
calibration of the mean source redshift because such con- 
tamination cannot be described by a single parameter. 
This is a potentially important effect which should be 
studied carefully using mock catalogs or simulations. 

Here we briefly estimate the possible contamination of 
foreground galaxies, based on the result of [35| which de- 
veloped a clipping method to reduce the effect of photo- 
z outliers in weak lensing analysis. Assuming a Subaru 
HSC-type survey with the g'r'i'z'y' filter set, the esti- 
mated contamination rate for the fiducial source galaxy 
population defined by z s > 1.5 is ~ 0.6, if no clipping is 
applied. However the contamination rate is reduced to 
~ 0.2 if 70% of galaxies are clipped. The resulting source 
galaxy number density of n ga i ~ 4 arcmin" 2 (z s > 1.5) 
after clipping is in fact similar to the value adopted in the 
paper, n ga i ~ 5.2 arcmin , because of our conservative 
choice of the source galaxy distributions. Although the 
estimated contamination rate is still large, we can fur- 
ther reduce the contamination by adding near-infrared 
images or by decreasing the minimum redshift. For in- 
stance, if we add JHK-b&nd images that can be delivered 
from the VISTA Kilo-degree Infrared Galaxy (VIKING) 
survey, the contamination rate becomes < 3%, which 
should be small enough not to have a significant impact 
on our result. We can achieve similar contamination rate 
of < 3% by decreasing the minimum source redshift from 
z s ,min = 1-5 to ~ 1.1, which has been shown to have 
a small impact in terms of the dark energy FoM (see 
Fig. [T5]) . A more comprehensive analysis of the effect of 



photo-z outliers on this technique is a future work. 

One may consider that the photo-z outliers to clus- 
ter redshifts can be to some extent monitored by using 
an angular cross-correlation method of the source galaxy 
population with objects with spectroscopic redshifts or 
secure photo-z estimates, such as cluster themselves or 
spectroscopic LRG samples (e.g., [H, [93[ for a study 
of the cross-correlation method). However, the cross- 
correlation method to eliminate the photo-z outliers may 
be limited by the magnification bias, as discussed be- 
low. First, lensing due to the foreground mass distri- 
bution causes the area of a given patch on the sky to 
increase, and thus diluting the number density of back- 
ground galaxies. Second, galaxies fainter than the lim- 
iting magnitude can be magnified to cross the thresh- 
old. Therefore, if the magnitude cut is used in selecting 
source galaxies in addition to their colors, the magni- 
fication bias may cause the observed densities of back- 
ground galaxies to be increased or decreased, depending 
on the slope of the galaxy number counts. This mag- 
nification bias causes an apparent correlation between 
the densities of background galaxies and the cluster dis- 
tribution (e.g., [83| ), and hence degrades the power of 
the cross-correlation method. Here we give a rough 
estimate on the effect. The apparent cross-correlation 
between clusters of redshift slice z; and the source 
galaxy distribution is given as (n ga i(z s )ro c i(z/)) oc (2 — 
5s)(Az/H( Zl ))W K (zi)b h (M)U(zi) (see Eq. HHJ>, where 
£ m is the mass correlation function of the cluster red- 
shift Zi and s is the slope of the galaxy number count 
as a function of the galaxy magnitude to, i.e., s = 
dlnn ga i/dlnm. On the other hand, a cross-correlation 
between (the outliers of) source galaxies and the clus- 
ter distribution is given as ( i n^° to ~ z (z s )n c \(zi) X j oc 

fautiiex(zi)Azbh(M)bg£ m (zi), where /outlier (z;) is the frac- 
tion of photo-z outliers leaking into the cluster redshift 
z;. Hence the ratio of these cross-correlations can be 
found to be O(0.1)// ou tiicr for z; ~ 0.5, assuming b g ~ 1 
and the magnitude slope (2 — 5s) ~ 1 as implied from 
observations [87| . Thus the cross-correlation method can 
eliminate photo-z outliers if the outliers contribute to the 
source galaxy catalog by more than 10% in a given clus- 
ter redshift slice. This rough estimate suggests that the 
cross-correlation method may not be so useful to per- 
fectly eliminate the photo-z outliers. Even if this is the 
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FIG. 16: The FoM as a function of the prior on the multi- 
plicative shear error m 7 , which is a newly included nuisance 
parameter in the analysis. See text for more details. 



case, the shear and the magnification bias arise from the 
same mass distribution around the clusters, and there- 
fore this magnification bias effect on the source galaxy 
population may be iteratively calibrated by combining 
the measurements of the stacked shear profile and the 
cross-correlation function. More optimistically, we may 
be able to improve the mass calibration with staked lens- 
ing by adding information from the magnification bias 
(e.g., HI). 

We note that the intrinsic alignments of galaxy ellip- 
ticities, which are a significant source of systematic errors 
for cosmic shear measurements (e.g., [6l|), are not impor- 
tant for our analysis. This is because the stacked lensing 
is linear in shear, and does not include any correlation 
between the shapes of different galaxies. 

The shape measurement of faint galaxies involves sev- 
eral uncertainties as extensively studied in the literature 
(e.g., |96|). For the cosmic shear measured from upcom- 
ing wide-field surveys, the uncertainties such as the addi- 
tive bias and multiplicative errors in the shear estimates 
need to be calibrated to better than 1% or 0.1% in shear 
in order not to si gnifi cantly degrade cosmological param- 
eter estimations [3J|. Since our method focuses on the 
single population of source galaxies, all clusters share the 
same systematic errors in the shear calibrations, even if 
they exist, in such a way that they are uncorrelated with 
the distribution of foreground clusters. In this sense, 
this effect is similar to the source redshift uncertainty, 
especially for the multiplicative shear error. Thus we 
expect that the shape measurement errors can similarly 
be self-calibrated to the comparable accuracy to that of 
the source redshift uncertainty by using our method. To 
check this point, we repeat the Fisher matrix analysis for 
our fiducial Subaru HSC-type survey including the multi- 



plicative shear error m 7 = <y mcasurt y ,y truc _ i as additional 
nuisance parameter (i.e., 34 parameters in total). With- 
out any prior on m 7 , we find the FoM to be 27.5, which is 
only slightly smaller than the original FoM of 28.6. This 
indicates that the multiplicative shear error does not sig- 
nificantly degrade our cosmological result. Furthermore, 
the marginalized error on the multiplicative shear error 
is cr(m 7 ) = 0.04, which is the level already achieved by 
the current shear measurement techniques (9?| • Fig. [IB] 
shows the FoM as a function of the prior on to 7 . 

In this paper we assume the Gaussian statistics in com- 
puting the covariance of stacked weak lensing signal. The 
assumption is valid for the 1-halo term because one of the 
points is always fixed to the cluster center. On the other 
hand, the 2-halo term dominates only at very large scales, 
I < 10 2 (see Fig. [7]), where the effect of the non-Gaussian 
error is small. Thus we expect the effect of non-Gaussian 
errors is small for our results. We also ignored the cross- 
covariance between the cluster number counts and the 
cluster power spectra (the 2-halo term and cluster-cluster 
angular power spectra). [60( has shown that the cross- 
correlation coefficients peak around £ ~ 10 3 , whereas in 
our calculations the information from the power spectra 
comes mainly from I ~ 10 2 (see Figs. [TJandlH]). Therefore, 
the effect of the cross-covariance should also be small in 
our case. 

Our results are based on the flat-sky and Limber's ap- 
proximations. While the full calculation without relying 
on these approximations is not difficult [62j . their effects 
on our results, i.e, forecast constraints on cosmological 
parameters, are expected to be small, because we are in- 
terested in sensitivities of the signals, rather than signals 
themselves, on cosmological parameters. An exception 
is constraints on the primordial non-Gaussianity, /nl, 
as the constraints come mostly from very large scales 
(£ ~ 0(1)) where these approximations obviously break 
down. Therefore, our results on /nl should be taken with 
caution. 

Also important are the effects of theoretical uncertain- 
ties, such as the inaccuracy of the halo mass function, the 
halo bias (including the assembly bias and the stochas- 
ticity), and the radial profile. Previous work 98, 99] has 
found that theoretical uncertainties of the mass function 
and the halo bias can indeed be important. Such un- 
certainties might also be self-calibrated to some extent 
from the combination of observables studied in this pa- 
per, but the detailed exploration is beyond the scope of 
this paper. 

There are several additional systematics in cluster 
weak lensing studies, including the effects of central 
galaxies, non-weak shear and magnification bias. Some 
of these effects on stacked cluster weak lensing have been 
investigated by [S5| . Again, our conservative choice of 
f ragx = 10 4 is meant to reduce these effects, but in prac- 
tical analysis more careful treatments will be needed to 
minimize such effects, as done recently by (37j . 
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VII. CONCLUSION 

In this paper, we have estimated accuracies on cos- 
mological parameters derivable from a joint experiment 
of the cluster observables (number counts and cluster- 
cluster correlation functions) and stacked weak lensing 
signals of distant galaxy shapes produced by the clus- 
ters, both of which can be drawn from a wide-field opti- 
cal imaging survey. A striking advantage of this method 
is the derived cosmological constraints become insensi- 
tive to various systematic errors such as the source red- 
shift uncertainty and the cluster mass-observable rela- 
tion. Our key assumption to overcome these systematic 
uncertainties is to focus on a single population of back- 
ground source galaxies to extract lensing signals around 
cluster at different rcdshifts. By cross-correlating the 
source galaxy shapes with the distribution of clusters 
with known redshifts (or at least secure photo-z esti- 
mates), we can statistically separate out the contribu- 
tion at a given redshift slice from the total shear signals 
of the source galaxies. Put another way, this method en- 
ables a tomographic reconstruction of lensing structures. 
Moreover, the systematic errors of the cluster observ- 
ables and the stacked lensing are closely related with each 
other, which enables an efficient self-calibration of these 
systematics (see Fig. [T] for the conceptual flow-chart). 
Specifically, the stacked lensing provides direct measure- 
ments of the mean mass of clusters, and therefore allows 
to calibrate the cluster mass-observable relation, whereas 
we can calibrate the source redshift uncertainty via the 
relative strengths of the stacked lensing signals at differ- 
ent cluster redshifts, because the stacked lensing signals 
at different cluster redshifts share the same source red- 
shift uncertainty (only if the single population of back- 
ground source galaxies is used, as assumed in this paper). 
As a result, cosmological parameters including dark en- 
ergy parameters are well constrained yet least affected by 
these significant systematic effects (see Tables ITlII and ITVl 
and Figs. QUI [H and . 

To estimate the cosmological power of the method 
above, we have developed a formulation to compute all 
the observables based on the halo model, and also derived 
the power spectra of the relevant correlation functions. 
In particular we have shown that the halo off-centering 
effect on the stacked lensing signals is expressed by a 
simple analytic form in Fourier space (see Appendix [A"| . 
Moreover, the Fourier-space analysis greatly simplifies 
the Fisher matrix calculations, because the power spec- 
trum covariances have the simpler forms than in real 
space. However, note that, even if the real-space ob- 
servables such as the angular correlation functions are 
used as done in the previous measurements, the cosmo- 
logical power should be equivalent to what was shown 
in this paper, because the two-point correlations and the 
power spectra contain the equivalent information at two- 
point level. We believe that the formulation developed in 
this paper is useful in further extending the method, e.g., 
including additional power spectrum information such 



as the SZ power spectrum available from the upcom- 
ing wide-field SZ surveys with an overlap with optical 
imaging surveys, such as the HSC and the ACT or the 
DES and the SPT. The additional information can fur- 
ther improve the cosmological constraints, although new 
systematic errors introduced by new observations should 
carefully be studied. This is our future project, and will 
be presented elsewhere. 

The results shown in this paper can be compared with 
the previous work studying the self-calibration technique 
for cluster observables, where the number counts and 
the cluster clustering information are combined to self- 
calibrate the mass-observable relation (e.g., [38]). We 
have found that adding the stacked lensing signals sig- 
nificantly improves the marginalized errors on each cos- 
mological parameter by up to a factor of ~ 2, espe- 
cially the dark energy FoM by a factor of ~ 4, which 
is achieved without assuming any priors on a number of 
nuisance parameters to model various systematic errors, 
including the mass-observable relation, the mean source 
galaxy redshift, offsets of cluster centers, and concen- 
tration parameters. The expected constraints on dark 
energy are quite comparable to those obtained from to- 
mographic cosmic shear measurements obtained without 
including any systematic errors. We have shown that an 
accurate self-calibration of these systematic errors is in- 
deed attained, with the mean source redshift calibrated 
to o~(z m ) ~ 0.1 and the mean cluster mass in each bin 
to er(ln Mbias) ~ 0.05 - 0.1 (see Fig. QJJ. We have also 
confirmed that clustering of massive clusters is powerful 
in constraining primordial non-Gaussianity. The robust 
constraint of <t(/nl) 10 can be obtained if the joint 
experiment is applied to upcoming imaging surveys. 

In summary, we have demonstrated that weak lens- 
ing information and cluster observables, both available 
from the same imaging survey data, are complementary 
to each other. The cosmological power is greatly im- 
proved and an efficient self-calibration of various system- 
atic uncertainties can be attained when the two measure- 
ments are combined. This is particularly promising given 
that there are several planned wide-field optical imaging 
surveys overlapping with SZ surveys: Subaru HSC sur- 
vey and ACT, the DES and the SPT, and also the all-sky 
Planck CMB survey which will be available in a few years. 
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FIG. 17: Effect of off-centering on radial profiles of a single cluster with M = 10 15 Mq and z = 0.5, computed using the 
Fourier space description derived in Eq. (|A3|1 . From left to right panels, we show the convergence profile, the tangential shear 
profile, and the Fourier transform of the convergence. Solid lines show the case with no offset (0 o g = 0), whereas dashed and 
dotted lines indicate profiles with the offset o g = 0.1' and 1', respectively. 



Symbol 


Definition 


Eq. 




Linear mass power spectrum 






Nonlinear mass power spectrum 




M 


Halo mass 




M ohs 


Cluster mass observable 




ClnM 


Variance in mass-observable rel. ([9]) 


M bias 


Halo mass bias 


© 




The i-th redshift bin of clusters 


cp 


Mb t mi n , M(, >m j n 


The 6-th mass bin of clusters 


CP 


n i(b), Ni(b) 


Cluster number counts 


CP, CP 


dn/dM 


Halo mass function 


CP 


S i{h) {M) 


Selection function of clusters 


CP 




Cluster power spectrum 


CP 


Wf {b) {M) 


Weight function of clusters 


CP 


b h (M) 


Halo bias 


CP 


W K {z) 


Lensing weight function 


CP 


C KK (£) 


Shear power spectrum 


Co]) 


<7+> iW (0) 


Tangential shear profile 


CP 




Shear-cluster power spectrum 


106) 




1-halo term of C hK (£) 


(27J 




2-halo term of C hK (£) 


psi 


u M {k) 


Halo mass profile 




Hm (£) 


Halo convergence profile 


([281) 


KM,os(£) 


With halo centering offset 


CP 


c(M, z) 


Halo concentration 


(1211) 



TABLE VI: Symbols used in the paper. 



Appendix A: Derivation of Off-centered Lensing 
Signals 

We consider an azimuthally averaged convergence pro- 
file, whose true center is located at 0q, at distance 8 from 
the coordinate origin. It is expressed as 

«off(0) = / ^-<e' - B ) 

J 27T 

= J d 2 0' k{0' -0 o )Wg{0'), (Al) 



where W$(8 ) is defined by 
W e (B') 



2?r 



nH\B'\ 



(A2) 



The Fourier transform in angular space is given by 
~K M , oS (l) = J d 2 0e- M J d 2 e' K (0' - B )Wg(9'), 



d 2 0'Jn(£d') 



d 2 ^ 

2?r 



(A3) 



where o g = \0q\, and used the integral representation of 
the Bcsscl function 



J (x) -- 
and the closure equation 



o 2^ 



xdx Jo(ax) Jo(bx) = —S(a — b). 

a 



(A4) 



(A5) 
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Therefore the effect of off-centering is simply expressed as 
the multiplication of the zeroth order Bessel function in 
Fourier space. Note that Eq. (|A3|) is general expression 
applicable to any convergence profile k. For reference, 
adopting the flat sky approximation convergence profiles 
k(9) and tangential shear profile 7+(#) with the effect of 
the offset are given by 

f £d£ 

«W = / T7-«M,offWJo(^), (A6) 



f £d£ 

7+(0) = / ^ZmMQMM)- (A7) 



In Fig. [TTJ we show the effect of off-centering on con- 
vergence and tangential shear profiles. Basically the ef- 
fect is such that it flattens the convergence profile, and 
eliminates the tangential shear signals, below the offset 
size. 



Appendix B: Summary of Symbols 

In Tabic I VII we give a summary of symbols commonly 
used in the paper. 
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